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METHODS FOR ANALYSIS OF GENE EXPRESSION 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to and benefit of U.S. application 

60/179,006, filed January 28, 2000, the full disclosure of which is incorporated herein by 
reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 
SPONSORED RESEARCH AND DEVELOPMENT 
The United States government may own rights in the present invention 

pursuant to grant numbers HG01700-02, R43-CA83382 and N43-ES-81006 from the 

National Institutes of Health. 

BACKGROUND OF THE INVENTION 
Functional genomics is a rq)idly growing area of investigation, which 

includes research into genetic regulation and expression, analysis of mutations that cause 

changes in gene function, and development of experimental and computational methods 

for nucleic acid and protein analyses. The Human Genome Project has been the major 

catalyst driving this research; it has been through the development of high-thix)ughput 

technologies that it has been possible to map and sequence complex genomes. However, 

while the nucleic acid sequence information elicited by these technologies represents the 

"structural" aspects of the genome, it is the interworkings of the genes encoded therein, 

and the gene products derived from these sequences, that will give a meaningful context 

to this information. In particular, gene expression monitoring can be utilized to examine 

groups of related genes, interiocking biochemical pathways, and biological networics as a 

whole. 

This rapidly growing set of cloned human genes provides a plethora of 
candidate drug targets for testing against complex chemical libraries. In order to 
efficiently test the impact(s) of a large number of putative drug compounds on the 
expression profile of one or more sets of genes, methods are needed that are sensitive, 
quantitative, extremely rapid, and adaptable to automation, in onler to be cost-effective. 
Present day technologies do not meet these demands. The present invention addresses 
this need by providing novel methods for analyzing gene expression, systems for 
implementing these techniques, compositions for preparing a plurality of amplification 
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products from a plurality of mRNA target sequences, and related pools of amplification 
jnoducts. 

SUMMARY OF THE INVENTION 

The present invention provides methods for analyzing gene expression. 

5 The methods include obtaining a plurality of cDNA target sequences, and multiplex 
amplifying these sequences, a process which involves combining the plurality of target 
sequences with a plurality of target-specific primers and one or more universal primers, to 
produce a plurality of amplification products. The target sequences are obtained in any of 
a number of manners, such as by performing reverse transcription on a set of mRNA 

10 molecules. The mRNA molecules are optionally derived from cells, organisms, or cell 
cultures, which are optionally exposed to one or more specific treatments that potentially 
alter the biolo^cal state of the cell, organism, or cell culture. 

Target-specific primers for use in the methods of the present invention 
include oUgonucleotides con^msing a first sequence that is derived from a target gene of 

15 interest and positioned within a 3' region of the oligonucleotide, and a second sequence 
that is complementary to a universal primer and positioned witfiin the 5* region of the 
oligonucleotide. The target specific primers can be categorized as forward primers or 
reverse primers, depending upon the relative orientation whether the primer versus die 
polarity of the nucleic acid sequence (e.g.. whedier the primer binds to tiie coding strand 

20 or a complementary (noncoding) strand of the target sequence). 

The universal primere used in Oie methods of the present invention are 
sequences common to a pluraUty of target-specific primers, but preferably not presem in 
the template nucleic acid (i.e.. the plurality of target sequences). As such, a universal 
primer typically does not hybridize to the target sequence template during a PGR reaction. 

25 However, since the universal primer sequence is complementary to a portion of one or 
more target-specific primers used in the present invention, die universal primer can 
initiate polymerization using a target-specific primer-amplified product as a template. In 
some embodiments of the present invention, multiple universal primers having sequences 
distinct from one anotiier are utilized; these universal primers are then called "semi- 

30 universal" primers. As one example, a plurality of semi-universal primers can include 
primer sequences that are complementary to one or more forward target-specific primers, 
one or more leverse target-specific primers, or a combination tfieieof . 
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Optionallyt the multiplex amplification process involves simultaneously 
amplifying a plurality of cDNA molecules in the same reaction mixture. This can be 
achieved, for example, by employing one or more target-specific primer pairs (where 
each pair comprising a forward target-specific primer and a reverse target-specific primer) 
5 and one or more universal primer pairs, (also comprising pairs of forward and reverse 
universal primers). In some embodiments of the present invention, the multiplex 
amplification involves providing the universal primer in an excess concentration relative 
to the target'Specific primer. 

In some embodiments of the methods of the present invention, the length 

10 of one or more of the universal primers or target-specific primers is altered prior to 
combination in the multiplex amplification step. This alteration in length can be 
achieved, e,g., by adding nucleotides to the end of the primer sequence, inserting 
nucleotides within the primer sequence, incorporating a non-nucleotide linker within the 
primer sequence, or cleaving a cleavable linkage within the primer sequence. As one 

15 example, alteration of the length of a target-specific primer is achieved by inserting 

nucleotides between the universal sequence portion (i.e., that sequence complementary to 
the universal primer sequence) and the target-specific sequence of the primer. 

One or more of the nucleic acid sequences used as universal primers and 
target-specific primers in the methods of the present invention can optionally include a 

20 cleavable linkage or a non-nucleotide linker as a sequence element. This non-nucleotide 
linker can include, e.g., non-cleavable linkages, alkyl chains, or abasic nucleotides. 
Furthermore, the nucleic acid sequences used as universal primers and target-specific 
primers in the methods of the present invention can optionally include one or more labels. 
Labels for use in the methods of the present invention can include, e.g., a chromaphore, a 

25 fluorophore, a dye, a releasable label, a mass label, an affinity label, a friction moiety, a 
hydrophobic group, an isotopic label, or a combination thereof. The same label can be 
incorporated into disparate primers used in a multiplexed amplification; alternatively, 
unique labels or combination of labels can be associated with each member of the 
plurality of primers. 

30 Furthermore, the multiplex amplification optionally includes a reference 

sequence that contains a region homologous to at least one member of the plurality of 
target-specific primers. The reference sequence (or sequences) can be endogenously 
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present in the dDNA containing the target sequence, or it can be exogenously added to the 
cDNA sample. 

One or more members of the plurality of amplification products are 
separated by any of a variety of techniques known to those of skill in the art. In a 

5 preferred embodiment of the present invention, the members are separated using one or 
more separation techniques, such as mass spectrometry, electrophoresis (using, for 
example, capillary electrophoresis, microcapillary electrophoresis, agarose and/or 
acrylamide gel platforms), chromatography (e.g.. such as HPLC or FPLC), or various 
microfluidic techniques. 

10 The one or more members are detected by any of a number of techniques, 

thereby generating one or more sets of gene expression data. For example, in a preferred 
embodiment, the amplification products are separated and detected by performing HPLC 
followed by mass spectroscopy. 

Detection is poformed, for example, by measuring the presence, absence, 

15 or quantity/amplitude of one or more properties of the amplification products. Example 
properties of the amplification products include, but are not limited to, mass, light 
absorption or emission, and one or more electrochemical properties. In embodiments in 
which one or more of the primers includes a label, the inherent property can be dependent 
upon the identity of the label. In one embodiment, detection of the amplification products 

20 involves resolving a first signal from a singly labeled amplification product and a second 
signal ftom a single labeled (or multiply labeled) amplification product by deconvolution 
of the data. In an alternative embodiment, detection of the amplification products 
involves resolving a first signal ftom a singly labeled amplification product and a second 
signal from a single or multiply labeled amplification product by reciprocal subtraction of 

25 the first or second signal from an overiapping signal. Thus, one or more amplification 
products are detected and the information collected is used to generate a set of gene 
expression data. 

The set of gene expression data are stored in a database; this data is tiien 
used, e.g., to perform a comparative analysis (for example, by measuring a ratio of each 
30 target gene to each reference gene or other analysis of interest). 

The present invention also provides methods for analyzing gene 
expression including the steps of obtaining cDNA from a plurality of samples for a 
plurality of target sequences; performing a plurality of multiplexed amplifications of tiie 
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target sequences, thereby producing a plurality of multiplexed amplification products; 
pooling the plurality of multiplexed amplification products; separating the plurality of 
multiplexed amplification products; detecting the plurality of multiplexed amplification 
products, thereby generating a set of gene expression data; storing the set of gene 
expression data in a database; and performing a comparative analysis of the set of gene 
expression data. As in the previous embodiments, a plurality of target-specific primers 
and universal primers are employed in the multiplexed amplification step. Either the 
universal primer(s) or the target-specific primer(s) can be labeled. In one embodiment of 
these methods, a first multiplexed amplification is performed using a primer having a first 
label that produces a first signal, and a second multiplexed amplification is performed 
with a primer comprising a second label that produces a second signal, wherein the first 
and second signals are distinguishable from one another. 

In another embodiment, the plurality of amplification products are detected 
by shifting the mobility of member amplification products relative to one another For 
example, amplification of the target sequences is performed using universal primers 
having two or more lengths; detection of the plurality of multiplexed amplification 
products produced using these primers involves measuring one or more size shifts among 
the plurality of multiplexed amplification products. Alternatively, the method is 
performed using target-specific primers having two or more lengths, leading to generation 
of differentially-sized amplification products. The shift in size can be achieved, for 
example, by using primers having cleavable linkages incorporated into their sequences. 
Alternatively, the shift in size can be achieved by incorporation of a friction moiety into 
one or more of the universal primers, thereby creating a reduction in mobility of the 
amplification products. 

The multiplex amplification reaction used in the methods of the present 
invention includes, but is not limited to, a polymerase chain reaction, a transcription- 
based amplification, a self-sustained sequence replication, a nucleic acid sequence based 
amplification, a ligase chain reaction, a ligase detection reaction, a strand displacement 
amplification, a repair chain reaction, a cyclic probe reaction, a rapid amplification of 
cDNA ends, an invader assay, a bridge amplification or rolling circle amplification, or a 
combination thereof. 

The present invention also provides methods for analyzing gene 
expression including the steps of obtaining cDNA from multiple samples; amplifying a 
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plurality of target sequences from the cDNA, thereby producing a multiplex of 
amplification products; separating and detecting the amplification products using a high 
throughput platform, wherein detecting generates a set of gene expression data; storing 
the set of gene expression data in a database; and perfoiming a comparative analysis of 
S the set of gene expression data. 

The metiiods of the present invention optionally include performing one or 
more of the amplifying, separating or detecting steps in a high tiwoughput fonnat. For 
example, tiie reactions can be performed in multi-well plates. Optionally, anywhere 
between about 96 and about 5000 reactions, preferably between about 500 and 2000 

10 reactions, and more preferably about 1000 reactions, are performed per hour using tiie 
methods of the present invention. Furthermore, one or more miniaturized scale platforais 
can be used to perform the methods of the present invention. 

The present invention also provides systems for analyzing gene 
expression. The elements of ti»e system include, but arc not limited to, a) an amplification 

15 module for producing a plurality of amplification products from a pool of target 

sequences; b) a detection module for detecting one or more members of the plurality of 
ampliiication products and generating a set of gene expression data comprising a plurality 
of data points; and c) an analyzing module in operational communication witii fl»e 
detection module, tfie analyzing module comprising a computer or computer-readable 

20 medium coinprising one or more logical instructions which organize tfie plurality of data 
points into a database and one or more logical instnictions which analyze tfie plurality of 
data points. Any or all of these modules can comprise high throughput technologies 
and/or systems. 

The amplification module of tf»e present invention includes at least one 
25 pair of univereal primers and at least one pair of target-specific primers for use in the 
amplification process. Optionally, the amplification module includes a unique pair of 
universal primers for each target sequence. Furthermore, tiie amplification module can 
include components to perform one or more of the following reactions: a polymerase 
chain reaction, a transcription-based amplification, a self-sustained sequence replication, 
30 a nucleic acid sequence based amplification, a ligase chain reaction, a ligasc detection 
reaction, a strand displacement amplification, a repair chain reaction, a cyclic probe 
reaction, a rapid amplification of cDNA ends, an invader assay, or various solution phase 
and/or solid phase assays (for exanq>le, bridge amplification or rolling circle 
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amplification). The detection module can include systems for implementing separation of 
the amplification products; exemplary detection modules include, but are not limited to, 
mass spectrometry instrumentation and electrophoredc devices. 

The analyzing module of the system includes one or more logical 
5 instructions for analyzing the plurality of data points generated by the detection system. 
For example, the instructions can include software for performing difference analysis 
upon the plurality of data points. Additionally (or alternatively), the instructions can 
include or be embodied in software for generating a graphical representation of the 
pluraHty of data points. Optionally, the instructions can be embodied in system software 
10 which performs combinatorial analysis on the plurality of data points. 

The present invention also provides kits for obtaining a multiplex set of 
amplification products of target genes and references-genes. The kits of the present 
invention include a) at least one pair of universal primers; b) at least one pair of target- 
specific primers; c) at least one pair of reference gene-specific primers; and d) one or 
15 more amplification reaction enzymes, reagents, or buffers. The kits optionally further 
include software for storing and analyzing data obtained from the amplification reactions. 

Additionally, the present invention provides compositions for preparing a 
plurality of amplification products from a plurality of mRNA target sequences. TKe 
compositions include one or more pairs of universal primers; and one or more paira of 
target-specific primers. The present invention also provides for the use of the kits of the 
present invention for practicing any of the methods of the present invention, as well as the 
use of a composition or kit as provided by the present invention for practicing a method 
of the present invention. Furthermore, the present invention provides assays utilizing any 
of these uses. 

BRIEF DESCRIPTION ON THE FIGURES 

The following drawings form part of tiie present specification and are 

included to further demonstrate certain aspects of die present invention. The invention 
may be better understood by reference to one or more of Uiese drawings in combination 
witi) die detailed description of specific embodiments presented herein. 

HO. 1: Schematic of one embodiment of a set of taiget-specific primers 
and a universal prima- employed in die present invention. The abbreviation "TSP" 
indicates a target-specific primer, while "UP'* indicates a universal primer. Different line 
patterns (bold, dashed, etc.) symbolize different DNA sequences. 

7 



wo 01/55454 PCT/US01/027S7 
FIG. 2: Schematic drawing depicting coupled target-specific and universal 

printing of a PCR reaction. 

FIG. 3: Schematic depiction of exemplary reactions occurring in a 
multiplexed reverse transcriptase-based polymwase chain reaction (RT-PCR) reaction, 
using a combination oif target-specific and universal primers. 

FIG. 4: Exemplary profiles of original and "shifted" multiplex gene sets. 

HG. 5: acemplary profiles of multiplex gene sets using multiple 
fluorescent dye labels. 

DETAILED DISCUSSION 

DEFINrriONS 

Before describing the present invention in detail, it is to be understood that 
this invention is not limited to particular compositions or biological^ystems, which can, 
of course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to be limiting. As 
used in this specification and the appended claims, the singular forms "a", "an" and "the" 
include plural referents unless the content clearly dictates otherwise. Thus, for example, 
reference to "a device" includes a combination of two or more such devices, reference to 
"a gene fusion construct" includes mixtures of consttucts, and the like. 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to 
which the invention pertains. Although any methods and materials similar or equivalent 
to those described herein can be used in the practice for testing of the present invention, 
currently prefared materials and methods are described herdn. 

In describing and claiming the present invention, the following 
terminology will be used in accordance with the definitions set out below. 

The term "absolute abundance" or "absolute gene expression levels" refers 
to the amount of a particular species (e.g., gene expression product) present in a sample. 

The term "amplified product" refers to a nucleic «ad generated by any 
method of nucleic acid amplification. 

The term "attenuation" refers to a method of reducing the signal intensities 
of extremely abundant reaction pixjducts in a multiplex, such that the signals from all 
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products of a multiplex set of products fall within the dynamic range of the detection 
platform used for the assay. 

The tenn "'blocking group" refers to a chemical modification at the 3' end 
of an amplification primer that does not interfere with hybridization between the primer 
and its target sequence, but cannot be extended by a DNA polymerase. 

The tenn "cDNA" refers to complementary or "copy'' DNA. Generally 
cDNA is synthesized by a DNA polymerase using any type of RNA molecule (e.g., 
typically mRNA) as a template. Alternatively, the cDNA can be obtained by directed 
chemical syntheses. 

The term "chemical treatment" refers to the process of exposing a cell, cell 
line, tissue or organism to a chemical or biochemical compound (or library of 
compounds) that has/have the potential to alter its gene expression profile. 

The term ^'complementary" refers to nucleic acid sequences capable of 
base-pairing according to the standard Watson-Crick complementary rules, or being 
capable of hybridizing to a particular nucleic acid segment rnider relatively stringent 
conditions* Nucleic acid polymers are optionally complementary across only portions of 
their entire sequences. 

The term "environmental stress" refers to an externally applied factor or 
condition that may cause an alteration in the gene expression profile of a cell. 

The tenn ''friction group" refers to a chemical or physical moiety auached 
to a nucleic acid for the purposes of reducing the mobility by frictional drag of that 
nucleic acid in a matrix or fluid across which an electric field is applied 

Hie term "gene" refers to a nucleic acid sequence encoding a gene 
product. The gene optionally comprises sequence information required for expression of 
the gene (e.g.» promoters, enhancers, etc.)* 

The tenn "gene expression" refers to transcription of a gene into an RNA 
product, and optionally to translation into one or more polypeptide sequences. 

The tenn "gene expression data" refers to one or more sets of data that 
contain information regarding different aspects of gene expression. The data set 
optionally includes information regarding: the presence of target-transcripts in cell or 
cell-derived samples; the relative and absolute abundance levels of target transcripts; the 
ability of various treatments to induce expression of specific genes; and the ability of 
various treatments to change expression of specific genes to different levels. 

9 
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The term "high throughput format" refes to analyzing more than about 10 
samples per hour, preferably about 50 or more samples per hour, more preferably about 
100 or more samples per hour, most preferably about 250, about 500. about 1000 or more 
samples per hour. 

The term "hybridization" tefws to duplex formation between two or more 
polynucleotides, e.g., to form a double-stranded nucleic acid. The abiUty of two regions 
of complementarity to hybridize and remain together depends of the length and continuity 
of the complementary regions, and the stringency of hybridization conditions. 

The term "label" refers to any detectable moiety. A label may be used to 
distinguish a particular nucleic acid from others that are unlabeled, or labeled differenUy, 
or the label may be used to enhance detection. 

The terms "raicroplate," "culture plate," and "multiwell plate" 
interchangeably refer to a surface having multiple chambers, receptacles or containers and 
generally used to perform a large number of discreet reactions simultaneously. 

The term "miniaturized format" refers to procedures or methods conducted 
at submicroUter volumes, including on both microfluidic and nanofluidic platforms. 

The term "multiplex reaction" ref«s to a plurality of reactions conducted 
simultaneously in a single reaction mixture. 

The term "multiplex amplification" refers to a plurality of amplification 
reactions conducted simultaneously in a single reaction mixture. 

The term "nucleic acid" refers to a polymer of ribonucleic acids or 
deoxyribonucleic acids, including RNA, mRNA, ANA, tRNA, smaU nuclear RNAs, 
cDNA, DNA, PNA. or RNA/DNA copolymers. Nucleic acid may be obtained from a 
cellular extract, genomic or extragenomic DNA, viral RNA or DNA, or 
artificially/chemically synthesized molecules. 

The term "platform" refers to the instramentation method used for sample 
preparation, amplification, product separation, product detection, or analysis of data 

obtained from samples . 

The term "primer" refers to any nucleic acid that is capable of hybridizing 

at its 3' end to a coiiq>lementary nucleic acid molecule, and that provides a free 3' 
hydroxyl terminus which can be extended by a nucleic acid polymerase. 

The term "reference sequence" refers to a nucleic acid sequence serving as 
a target of ampUfication in a san^le that provides a control for the assay. The reference 
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may be internal (or endogenous) to the sample source, or it may be an externally added 
(or exogenous) to the sample. An external reference may be either RNA, added to the 
sample prior to reverse transcription, or DNA (e.g., cDNA), added prior to PGR 
amplification. 

5 The term "relative abundance" or "relative gene expression levels" refers 

to the abundance of a given species relative to that of a second species. Optionally, the 
second species is a reference sequence. 

The tenn "RNA" refers to a polymer of ribonucleic acids, including RNA, 
mRNA, rRNA, tRNA, and small nuclear RNAs, as well as to RNAs that comprise 
10 ribonucleotide analogues to natural ribonucleic acid residues, such as 2-0-methylated 
residues* 

The terni "semi-universal primer" refers to a primer that is capable of 
hybridizing with more than one, but not all, of the target-specific primers in a multiplexed 
reaction. 

15 The term "separation system'* refers to any of a set of methodologies that 

can be employed to effect a size separation of the products of a reaction. 

The term "size separation" refers to physical separation of a complex 
mixture of species into individual components according to the size of each species. 

The term "target," "target sequence," or "target gene sequence" refers to a 
20 specific nucleic acid sequence, the presence, absence or abundance of which is to be 
determined In a preferred embodiment of the invention, it is a unique sequence within 
the mRNA of an expressed gene. 

The term "target-specific primer" refers to a primer capable of hybridizing 
with its corresponding target sequence. Under appropriate conditions, the hybridized 
25 primer can prime the replication of the target sequence. 

The term "template" refers to any nucleic acid polymer that can serve as a 
sequence that can be copied into a complementary sequence by the acdon of, for example, 
a polymerase enzyme. 

The term "transcription" refers to the process of copying a DNA sequence 
30 of a gene into an RNA product, generally conducted by a DNA-diiected RNA polymerase 
using the DNA as a template. 

The term "treatment" refers to the process of subjecting one or more cells, 
cell lines, tissues, or organisms to a condition, substance, or agent (or combinations 
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thereof) that may cause the cell, cell line, tissue or organism to alter its gene expression 
profile. A treatment may include a range of chemical concentrations and exposure times, 
and repUcate samples may be generated. 

The term "universal primer" refers to a replication primer comprising a 
5 universal sequence. 

The term "universal sequence" refers to a sequence contained in a plurality 
of primers, but preferably not in a complement to the original template nucleic acid (e.g., 
the target sequence), such that a primer composed entirely of universal sequence is not 
capable of hybridizing with the template. 

10 GENE EXPRESSION AS A MEASURE OF TF K T^TnLOGICAT ST ATF. OF A rRT J , 
Transcription of genes into RNA is a critical early step in gene expression, 
Consequendy, the coordinated activation or suppression of transcription of particular 
genes is an important component of the overall regulation of expression. A variety of 
well-developed techniques have been established that provide ways to analyze and 

15 quantitate gene transcription. 

Some of the earliest methods are based on detection of a label in RNA 
hybrids or protection of RNA from enzymatic degradation (see, for example. Current 
Protocols in Molecular Biology, F. M. Ausiibel et al., eds., Current Protocols, a joint 
venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., 

20 supplemented through 1999). Methods based on detecting hybrids include northern blots 
and slot/dot blots. These two techniques differ in that die components of the sample 
being analyzed are resolved by size in a northern blot prior to detection, which enables 
identification of more than one species simultaneously. Slot blots arc generally carried 
out using unresolved mixtures or sequences, but can be easily performed in serial 

25 dilution, enabling a more quantitative analysis. Both techniques are very time-consuming 
and require a fair amount of manual manipulation, making them expensive and unsuitable 
for hi^ throughput applications. 

In situ hybridization is a technique that monitors transcription by directty 
visualizing RNA hybrids in the context of a whole cell. This method provides 

30 information regarding subcellular localization of transcripts. However, it is not very 
quantitative, and is extremely technically demanding and time-consuming. As a 
consequence, this technique is best suited for basic research applications. 

12 
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Techniques to monitor RNA that make use of protection from enzymatic 
degradation include SI analysis and RNAse protection assays (RPAs). Both of these 
assays employ a labeled nucleic acid probe, which is hybridized to the RNA species being 
analyzed, followed by enzymatic degradation of single-stranded regions of the probe. 
5 Analysis of the amount and length of probe protected from degradation is used to 
determine the quantity and endpoints of the transcripts being studied. Although both 
methods can yield quantitative results, they are time-consuming and cumbersome, making 
them poor candidates for a high-throughput, low cost general assay for gene expression. 

A second family of assays developed for monitoring transcription makes 
10 use of cDNA derived from mRNA. Because the material analyzed is DNA, these assays 
are less sensitive to degradation, and also provide partial and/or full clones with which to 
localize and clone genes or coding sequences of interest. Methods include sequencing 
cDNA inserts of an expressed sequence tag (EST) clone library (Adams et al. (1991) 
Science 252:1631-1656), which may be coupled with subtractive hybridization to 
15 improve sensitivity (Sagerstrom et al. (1997) Annul Rev. Biochent 66:751-783). and 
serial analysis of gene expression (*'SAGE'\ described in U.S. Patent No. 5,866,330 to 
Kinzler et al/, Velculescu et al (1995) Science 270:484-487); and Zhang et al. (1997) 
Science 276:1268-1272). Both of these methods have been useful for identification of 
novel, differentially expressed genes. However, their methodologies yield untargeted 
20 information, i.e., they survey the whole spectrum of mRNA in a sample rather than 
focusing on a predetemiined set. As a result, very large data sets are required to derive 
reliable quantitative data» making these methods inappropriate and far too costly for high 
throughput screening strategies. 

Reverse transcriptase-mediated PGR (RT-PCR) gene expression assays are 
25 directed at specified target gene products^ overcoming some of the shortcomings 

described above. TTiese assays are derivatives of PGR in which amplification is preceded 
by reverse transcription of mRNA into cDNA. Because the mRNA is amplified, this type 
of assay can detect transcripts of very low abundance; however, the assay is not 
quantitative. Adaptations of this assay, called competitive RT-PCR (Becker-Andre and 
30 Hahlbiock (1989) Nucleic Acids Res. 17:9437-9446; Wang et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:9717-9721; Gilliland et al. (1990) Proc. Natl. Acad Sci. USA 
87:2725-2729) have been developed that are more quantitative. In these assays, a known 
amount of exogenous template is added to the reaction mixture, to compete with the 

13 
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target for amplification. The exogenous competitor is titrated against the target, aUowing 
for quantitation of a specified cDNA in the sample by comparing the amplification of 
both templates within the same reaction mixture. Because titration is required to generate 
quantitative data, multiple reactions are required for each analysis. While this type of 
assay is very sensitive and quantitative, these assays require multiple steps in 
development, execution, and analysis, making them very time-consuming, cumbersome, 
and expensive. The need to perform a titration reduces the overall throughput of the 
assay, and the requirement for an internal competitor for each target reduces the 
multiplexing capacity. These limitations restrict the usefulness of this assay in analysis of 

large numbers of gene sets. 

In Older to increase the throughput of the RT-PCR assay, Su et al. 
(BioTechniques (1997) 22:1107-1113) combined microplate-based RNA extraction with 
multiplexed RT-PCR. With this method, they demonsti^ted simultaneous analysis of 
three different tai^et mRNAs amplified from samples prepared from a 96 well 
mictoplale. However, changes in gene expression were only presented qualitatively. 

Other methods for targeted mRNA analysis include differential display 
reverse transcriptase PGR (DDRT-PCR) and RNA arbitrarily primed PGR (RAP-PCR) 
(see U.S. Patent No. 5.599.672; liang and Pardee (1992) Science 257:967-971; Welsh et 
al. (1992) Nucleic Acids Res. 20:4965-4970). Both methods use random priming to 
generate RT-PCR fingerprint profiles of transcripts in an unfiactionated RNA 
preparation. The signal genenited in these types of analyses is a pattern of bands 
separated on a sequencing gel. Differentially expressed genes appear as changes in the 
fingerprint profiles between two samples, which can be loaded in separate wells of the 
same gel. This type of readout allows identification of both up- and down-regulation of 
genes in the same reaction, appearing as either an increase or decrease in intensity of a 
band fiom one sample to anotiier. However, due to the complexity of the fingerprint 
profile, amplification products are strongly biased towards more abundant transcripts. 
Simultaneous amplification of hundreds to thousands of different products dramatically 
compresses the dynamic range of measurement. The combined result of ampUfication 
bias, dynamic range compression and other biases that result from the use of a complex 
mix of primeis eliminates the ability to quantitate relative changes in expression between 
the different genes in a sample. FurthOTnorc. the methodology is designed for 
identification of changes in the transcriptional profile of a whole ceU, but does not 
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provide any infonnatioii about the identities of the PCR products. To identify a species, a 
band must be excised from the gel, subcloned, sequenced, and finally matched to a gene 
in a sequence database. The complexity of the profile prohibits complete resolution of 
PCR products on the gel, causing a high incidence of false positives arising from multiple 
species existing in the same region of the gel. These characteristics make general 
fmgeiprinting techniques unsuitable for investigation of already identified transcripts, and 
precludes a high -throughput quantitative analysis. 

The TaqMan assay (Livak et al. (1995) PCR Methods Appl. 4:357-362) is 
a quenched fluorescent dye system for quantitating targeted mRNA levels in a complex 
mixture. The assay has good sensitivity and dynamic range, and yields quantitative 
results. But because detection is based on fluorescence of unfractionated products, it can 
be multiplexed only to the very low levels (i.e., two to four) as allowed by resolution of 
emission spectra of the chromaphores. Furthermore, due to overiapping emission spectra, 
multiplexing reduces the accuracy of quantitation. This limitation makes differential 
analysis problematic and increases the cost. Also, the assay is performed in real time 
during thermal cycling, greatly reducing the throughput of tttc assay. 

Nucleic acid microarrays have been developed recently, which have the 
benefit of assaying for sample hybridization to a large number of probes in a highly 
parallel fashion. They can be used for quantitation of mRNA expression levels, and 
dramatically surpass the above mentioned techniques in terms of multiplexing capability. 
These arrays comprise short DNA sequences, PCR products, or mRNA isolates fixed onto 
a solid surface, which can then be used in a hybridization reaction with a target sample, 
generally a whole cell extract (see, for example, U.S. Patent Nos„ 5,143,854 and 
5,807,522; Fodor et al. (1991) Science 251:767-773; and Schena et al. (1995) Science 
270:467-470), Microarrays can be used to measure the expression levels of several 
thousands of genes simultaneously, generating a gene expression profile of the entire 
genome of relatively simple organisms. Each reaction, however, is performed with a 
single sample against a very large number of gene probes. As a consequence, microarray 
technology does not facilitate high throughput analysis of very large numbers of unique 
samples against an airay of known probes. 

The present invention addresses the need for gene expression detection and 
quantitation methodologies by providing novel methods for analyzing gene expression, 
systems for implementing these techniques, compositions for preparing a plurality of 
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amplification products from a plurality of mRNA target sequences, and related pools of 
amplification pioducts. The methods of the present invention include the steps of (a) 
obtaining a plurality of target cDNA sequences; (b) multiplex amplifying the target 
sequences using a plurality of target-specific primers and one or more universal primers; 
5 (c) separating one or more members of the resulting plurality of amplification products; 
(d) detecting the one or more members of the plurality of amplification products, thereby 
generating a set of gene expression data; (e) storing the data in a database; and (f) 
performing a comparative analysis on the set of gene expression data, thereby analyzing 
the gene expression. The methods of the invention are highly sensitive; have a wide 
10 dynamic range; are rapid and inexpensive; have a high throughput; and allow the 

simultaneous differential analysis of a defined set of genes. The methods, compositions 
and kits of the invention also provide tools for gene expression data collection and 
relational data analysis. 

MRTHQDS FOR OUANTITATING GFNE EXPRE SSION LEVELS 

15 The controlled expression of particular genes or groups of genes in a cell is 

the molecular basis for regulation of biological processes and, ultimately, for the 
physiological or pathological state of the cell. Knowledge of the "expression profile" of a 
cell is of key importance for answering many biological questions, including the nature 
and mechanism of cellular changes, or the degree of differentiation of a cell, organ, or 

20 organism. Furthermore, the factors involved in determining the expression profile may 
lead to the discovery of cures that could reverse an adverse pathological or physiological 
condition. A defined set of genes can be demonstrated to serve as indicators of a 
particular state of a cell, and can therefore serve as a model for monitoring the cellular 
profile of gme expression in that state. 

25 The pharmaceutical dmg discovery process has traditionally been 

dominated by biochemical and enzymatic studies of a designated pathway. Although this 
approach has been pitxluctive, it is very laborious and time-consuming, and is generally 
targeted to a single gene or defined pathway. Molecular biology and the development of 
gene cloning have dramatically expanded the number of genes thai are potential drug 

30 targets, and this process is accelerating rapidly as a result of the progress made in 
sequencing the human genome. In addition to the growing set of available genes, 
techniques such as the syntiiesis of combinatorial chemical libraries have created 
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daunting numbers of candidate drugs for screening. In order to capitalize on these 
available materials, methods are needed that are capable of extremely fast and 
inexpensive analysis of gene expression levels. 

The present invention provides novel methods for the analysis of changes 
in expression levels of a set of genes. These methods include providing a plurality of 
target sequences, which are then analyzed simultaneously in a multiplexed reaction. 
Multiplexing the analysis improves the accuracy of quantitation; for example, signals 
from one or more target genes can be compared to an intcmal control. Multiplexing also 
reduces the time and cost required for analysis. Thus, the methods of the present 
invention provide for rapid generation of a differential expression profile of a defined set 
of genes, through the comparison of data from multiple reactions, 

The methods of the present invention include the steps of (a) obtaining a 
plurality of target nucleic acid sequences, generally cDNA sequences; (b) multiplex 
amplifying the target sequences using a plurality of target-specific primers and one or 
more uni versal primers; (c) separating one or more members of the resulting plurality of 
amplification products; (d) detecting the one or more members of the plurality of 
amplification products, thereby generating a set of gene expression data; (e) storing the 
data in a database; and (f) perfbnning a comparative analysis on one or more components 
of the set of gene expression data, tiiereby analyzing tiie gene expression* In an 
alternative embodiment, the methods of the present invention include the steps of 
obtaining cDNA ftom a plurality of samples for a plurality of target sequences; 
peifonning a plurality of multiplexed amplifications of the target sequences, tiiereby 
producing a plurality of multiplexed amplification products; pooling die plurality of 
multiplexed amplification products; separating the plurality of multiplexed amplification 
products; detecting die plurality of multiplexed amplification products, thereby generating 
a set of gene expression data; storing the set of gene expression data in a database; and 
performing a con^jarative analysis of the set of gene expression data. In yet anotiier 
embodiment, the methods of the present invention include the steps of (a) obtaining 
cDNA from multiple samples; (b) amplifying a plurality of target sequences from tiie 
cDNA, tiiereby producing a multiplex of amplification products; (c) separating and 
detecting die amplification products using a high tiiroughput platform, wherein detecting 
generates a set of gene expression data; (d) storing tiie set of gene expression data in a 
database; and ( ) pcrfoiming a comparative analysis of tfie set of gene expression data. In 
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a further embodiment, the presem invention provides methods for analyzing gene 
expression, including the steps of (a) obtaining cells, e.g. culturing one of several 
designated.cell lines; (b) optionally subjecting a set of the cultures to a specified 

treatment; (c) lysing the cells and isolating one or more RNA molecules; (d) synthesizing 
5 cDNA first strand molecules from a designated set of the mRNA molecules; (e) 

quantitatively amplifying the resulting set of cDNA products using target-specific primers 
in early rounds, coupled with amplifying the whole set by universal primers that have 
partial homology with all of the target-specific primers, and that contain a detectable 
label, preferably a fluorescent chromaphore, on at least one of the primers; (f) optionally 

10 pooling products of two or more separate reactions; (g) physically separating amplified 
products acconiing to their length; (h) detecting and quantitating the separated 
amplification products, for example, by deconvolution of data from any species of the 
same length (arising fi:om reactions that were pooled); (i) determining the relative 
abundance levels using an internal reference target; (j) storing the information in a gene 

15 expression database; and (k) performing a comparative analysis of the expression 

patterns. Each aspect of these methods of the present mvention is addressed in greater 
detail below. 

Spurees of Tarpet Sequences 

Target sequences for use in die methods of the present invention are 
20 obtained from a number of sources* For example, the target sequences can be derived 
from organisms or from cultured cell lines. Cell types utilized in the present invention 
can be either prokaryotic or eukaryotic cell types and/or organisms, including, but not 
limited to, animal cells, plants, yeast, fungi, bacteria, viruses, and the hke. Taiget 
sequences can also be obtained from other sources, for example, needle aspirants or tissue 
25 samples from an organism (including, but not limited to, mammals such as mice, rodents, 
guinea pigs, rabbits, dogs, cats, primates and humans; or non-manunalian animals such as 
nematodes, frogs, amphibians, various fishes such as the zebra fish, and other species of 
scientific interest), non-viable organic samples or their derivatives (such as a cell extract 
or a purified biological sample), or environmental sources, such as an air or water sample. 
30 Furthermore, target sequences can also be conmiercially or synthetically prepared, such 
as a chemical, phage, or plasmid library. DNA and/or RNA sequences are available from 
a number of commercial sources, including The Midland Certified Reagent Company 
(mcrc@oligos.com). The Great American Gene Company (http://www.genco.com), 
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ExpressGen Inc. (www.expressgenxom), Operon Technologies Inc. (Alameda, CA) and 
many others. 

Cell lines which can be used in the methods of the present invention 
include, but ane not limited to, those available from cell repositories such as the American 
5 Type Culture Collection (www.atccorg). the World Data Center on Microorganisms 
(http://wdcm.nig.ac.jp ), European Collection of Animal Cell Culture f www.ecacc.orp) 
and the Japanese Cancer Research Resources Bank fhttp://ccllbank.nihs. go.jp) . These 
cell lines include, but are not limited to, the following cell lines: 293, 293Tet-Off. CHO- 
AA8 Tet-Off, MCF7, MCF7 Tet-Off, LNCap, T-5, BSC-1, BHK-21, Phinx-A, 3T3, 

10 HeLa, PC3, DU145, ZR 75-1, HS 578-T, DBT, Bos, CVl, L-2, RK13, HTTA, HepG2, 
BHKJurkat, Daudi, RAMOS, KG-1, K562, U937, HSB-2, HL-60, MDAHB231, C2C12, 
HTB-26, HTB-129, HPIC5, A-431, CRL-1573, 3T3L1, Cama-1, J774A.1, HeLa 229, PT- 
67, Cos7, OST7, HeLa-S, THP-1, and NXA. Additional cell lines for use in the methods 
and matrices of the present invention can be obtained, for example, from cell Une 

15 providers such as Clonetics Corporation (Walkersville, MD; www.cIonetics.com) . 

Optionally, tlie plurality of target sequences are derived from cultured cells optimized for 
the analysis of a particular disease area of interest, e.g., cancer, inflammation, 
cardiovascular disease, diabetes, infectious diseases, proliferative diseases, an immune 
system disorder, or a central nervous system disorder. 

20 A variety of cell culture media are described in The Handbook of 

Microbiolojgical Media , Atlas and Parks (eds) (1993, CRC Press. Boca Raton, FL). 
References describing the techniques involved in bacterial and animal cell culture include 
Sambrook et aL» Molecular Cloning - A Laboratorv Manual (2nd Ed.), Vol. 1-3 (1989, 
Cold Spring Haifaor Laboratory, Cold Spring Harbor, New Yoric); Current Protocols in 

25 Molecular Biology. R M, Ausubel et al., eds., Current Protocols, (a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., supplemented through 
2000); Freshney, Culture of Animal Cells, a Manual of Basic Technique , third edition 
(1994, Wiley-Uss, New York) and the references cited therein; Hmnason, Animal Tissue 
Techniques , fourth edition (1979, W.H. Freeman and Company, New York); and 

30 Ricciardelli, et al. (1989) In Vitro Cell Dev. Biol. 25: 1016-1024. Information regarding 
plant cell culture can be found in Plant Cell and Tissue Culture in Liquid Svstems . by 
Payne et al. (1992, John Wiley & Sons, Inc. New York, NY): Plant Cell. Tissue and Orpan 
Culture: Fundamental Methods by Gamborg and Phillips, eds. (1995, Springer Lab 
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Manual. Springer-Vcrlag, Berlin ), and is also available in commercial literature such as 
the TJfe Science Reseatch Cell Cul^ iw-- rataloeue (1998) from Sigma- Aldrich. Inc (St 
Louis, MO) (Sigma-LSRCrC) and the Plant Tulture Catalogue and supplement (1997) 
also from Sigma-Aldrich, Inc (St Louis, MO) (Sigma-PCCS). 
5 In an exemplary embodiment of methods of the present invention, either 

primary or immortalized (or other) cell lines are grown in a master flask, then trypsinized 
(if they are adherent) and transferred to a 96-well plate, seeding each well at a density of 
10* to 10* cells/well. If the gene expression profile in response to a chemical treatment is 
sought, the chemical agent of choice is prepared in a range of concentrations. Aftra: a 
10 time of recovery and growth as appropriate to the cell line, cells are exposed to the 
chemical for a period of time that will not adversely impact the viability of the cells. 
Preferably, assays include a range of chemical concentrations and exposure times, and 
would include replicate samples. After treatment, medium is removed and cells are 
immediately lysed. 

15 In further embodiments of cell culOMe, formats other than a 96-well plate 

may be used. Other muWwell or microplate formats containing various numbers of wells, 
such as 6, 12, 48, 384, 1536 wells, or greater, are also contemplated. Culture formats that 
do not use conventional flasks, as well as microtiter formats, may also be used. 

Jreatment of Cells 

20 The cells lines or sources containing the target nucleic acid sequences, are 

optionally subjected to one or more specific treatments, or in the case of organisms, may 
already be in different pathological or physiological stages that induce changes in gene 
expression. For example, a cell or cell line can be treated with or exposed to one or more 
chemical or biochemical constituents. e.g.. pharmaceuticals, pollutants, DNA damaging 

25 agents, oxidative stress-inducing agents. pH-altering agents, membrane-disrupting agents, 
metabolic blocking agent; a chemical inhibitors, cell surface receptor Ugands. antibodies, 
transcription promoters/enhancers/inhibitors, translation promoters/enhancersAnhibitors, 
protein-stabilizing or destabilizing agents, various toxins, carcinogens or traatogens, 
characterized or uncharacterized chemical libraries, proteins, lipids, or nucleic acids. 

30 Optionally, the treatment comprises an environmental stress, such as a change in one or 
more environmental parameters including, but not limited to, temperamre (e.g. heat shock 
or cold shock), humidity, oxygen concentration (e.g., hypoxia), radiation exposure, 
culture medium composition, or growth saturation. Alternatively, cultured cells may be 
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exposed to other viable organisms, such as pathogens or other cells, to study changes in 
gene-expression that result from biological events* such as infections or cell-cell 
interactions. Responses to these treatments may be followed temporally, and the 
treatment can be imposed for various times and at various coiicentradons. Target 
sequences can also be derived fium cells or organisms exposed to multiple specific 
treatments as described above, either concurrently or in tandem (i.e., a cancerous tissue 
sample may be further exposed to aDNA damaging agent while grown in an altered 
medium composition). 

RNA Isolation 

In some embodiments of the present invention, total RNA is isolated from 
samples for use as target sequences. Cellular samples are lysed once culture with or 
without the treatment is complete by, for example, removing growth medium and adding 
a guanidinium-based lysis buffer containing several components to stabilize the RNA, In 
some embodiments of the present invention, the lysis buffer also contains purified RNAs 
as controls to monitor recovery and stability of RNA from cell cultures. Examples of 
such purified RNA templates include the Kanamycin Positive Control RNA fix}m 
Promega (Madison, WI), and 7.5 kb Poly(A)-Tailed RNA from life Technologies 
(Rockville, MD). Lysates may be used immediately or stored frozen at, e.g., -80°C. 

Optionally, total RNA is purified from cell lysates (or other types of 
samples) using silica-based isolation in an automation-compatible, 96-weQ fonnat, such 
as the Rneasy® purification platform (Qiagen, Inc.; Valencia, CA). Alternatively, RNA 
is isolated using solid-phase oligo-dT capture using oligo-dT bound to microbeads or 
cellulose columns. This method has the added advantage of isolating mRNA from 
genomic DNA and total RNA» and allowing transfer of the mRNA-capture medium 
directly into the reverse transcriptase xeaction. Other RNA isolation methods are 
contemplated, such as extraction with silica-coated beads or guanidinium. Further 
methods for RNA isolation and preparation can be devised by one skilled in the art. 

Alternatively, the methods of the present invention are performed using 
crude cell lysates, eliminating the need to isolate RNA. RNAse inhibitors are optionally 
added to the crude samples. When using crude cellular lysates, genomic DNA could 
contribute one or more copies of target sequence, depending on the sample. In situations 
in which the target sequence is derived from one or more highly expressed genes, the 
signal arising from genomic DNA may not be significant But for genes expressed at 
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very low levels, the background can be eliminated by treating the samples with DNAse, 
or by using primers that taiget splice junctions. For example, one of the two target- 
specific primers could be designed to span a spUce junction, thus excluding DNA as a 
template. As another example, the two target-specific primers are designed to flank a 
5 splice junction, generating larger PGR products for DNA or unspUced mRNA templates 
as compared to processed mRNA templates. One skilled in the art could design a variety 
of specialized priming applications diat would facilitate use of crude extracts as samples 
for the purposes of this invention. 

primer Desipn and Mult iplex Strategies 
10 Multiplex amplification of the target sequence involves combining the 

plurality of target sequences with a plurality of target-specific primers and one or more 

universal primers, to produce a plurality of amplification products. A multiplex set of 

target sequences optionally comprises between about two targets and about 100 targets. 

In one embodiment of the present invention, Uie multiplex reaction includes at least 5 

15 target sequences, but preferably at least ten targets or at least fifteen targets. Multiplexes 

of much larger numbers (e.g., about 20, about 50, about 75 and greater) are also 

contemplated. 

In one embodiment of the methods of the present invention, at least one of 
the amplification targets in the multiplex set is a tianscript that is endogenous to the 

20 sample and has been indqwndently shown to exhibit a fairly constant expression level 
(for example, a "housekeeping" gene). The signal from this endogenous reference 
sequence provides a control for converting signals of oflier gene targets into relative 
expression levels. Optionally, a plurality of control mRNA targets/reference sequences 
that have relatively constant expression levels may be included in the multiplexed 

25 amplification to serve as controls for each otiier. Alternatively, a defined quantity of an 
exogenous purified RNA species is added to the multiplex reaction or to the cells, for 
example, with the lysis reagents. Almost any purified, intact RNA species can be used, 
e.g. the BCanamycin Positive Control RNA or the 7.5 kb Poly(A)-Tailed RNA mentioned 
previously. This exogenously-added amplification target provides a way to monitor the 

30 recovery and stability of RNA ftom cell cultures. It can also serve as an exogenous 

reference signal for converting the signals obtained ftom the sample mRNAs into relative 
expression levels, to still another embodiment, a defined quantity of a purified DNA 
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species is added to the PGR to provide an exogenous reference target for convening the 
signals obtained from sample mRNA targets into relative expression levels. 

In one embodiment of the present invention, once the targets that comprise 
a multiplex set are determined, primer pairs complementary to each target sequence are 
designed, including both target-specific and universal primers. This can be accomplished 
using any of several software products that design primer sequences, such as OLIGO 
(Molecular Biology Insights, Inc, CO), Gene Runner (Hastings Software Inc., NY), or 
Primers (The Whitehead Institute, MA). Figure 1 illustrates the elements of design of 
exemplary target-specific primers (TSPs) and universal primers (UPs). Target specific 
primers (TSPl, TSP2, TSP3, TSP4 and TSPS) are comprised of at least two portions. 
One portion, shown as a solid line within the 5' region of each of the five TSP sequences, 
includes a region complementary to a selected "universal sequence." The universal 
sequence is utilized to allow amplification of multiple targets (having divergent 
sequences) while using the same primer (e.g., the UP). The universal sequence is 
contained only in the primers, and preferably is not present in any nucleic acid (or 
complement thereof) provided by the sample being tested. A second portion of the TSPs, 
shown as variable lines (solid, dotted, dashed, etc) within the 3' region of the sequence, 
represents the sequence that is complementary to and will hybridize with one of a 
plurality of designated target sequences In Figure 1, a single universal primer Qabeled as 
"UP") is depicted; however, multiple universal primers having different or unique 
sequences or labels can be employed in the methods of the present invention. Optionally, 
the primer design also includes consideration of ptoperdes beyond the encoded sequence 
of the primer, such as annealing temperature, 3 '-end hybridization stability, and 
minimization of sequences that would allow anneahng among the primers themselves. 

Oligonucleotide primers are typically prepared by the phosphoramidite 
approach. In this automated, solid-phase procedure, each nucleotide is individually added 
to the 5*-end of the growing oligonucleotide chain, which is in turn attached at the 3'-end 
to a solid support. The added nucleotides are in the form of trivalent 3*-phosphoramidites 
that are protected from polymerization by a dimethoxytrityl ("DMT") group at the 5'- 
position. After base induced phosphoramidite coupling, mild oxidation to give a 
pentavalent phosphotriester intermediate and DMT removal provides a new site for 
oligonucleotide elongation* These syntheses may be performed on, for example, a Perkin 
Elmei/Applied Biosystems Division DNA synthesizer. The oligonucleotide primers are 
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then cleaved off the solid support, and the phosphodiester and exocyclic amino groups are 
depiDtected with ammonium hydroxide. 

Nucleic Acid Hybridization 

The length of complementary sequence between each primer and its 

5 binding partner (i.e. the target sequence or the universal sequence) should be sufficient to 
allow hybridization of the primer only to its target within a complex sample at the 
annealing temperature used for the PGR. A complementary sequence of, for example, 
about 15. 16, 17, 18, 19, 20. 21, 22, 23. 24. or 25 or more nucleotides is preferred for both 
the target-specific and universal regions of the primers. A particularly preferred lengUi of 

10 each complementary region is about 20 bases, which will promote fonnation of stable and 
specific hybrids between the primer and target. 

Nucleic acids "hybridize" when they associate, typically in solution. 
Nucleic acids hybridize due to a variety of well characterized physico-chemical forces, 
such as hydrogen bonding, solvent exclusion, base stacking and the like. An extensive 

15 guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratpn 

Te.rhni qnes in Biochemistry and Molec u lar Binlogv-Hvhridization with Nucleic Acid 
Probes, part I, chapter 2. •'Overview of principles of hybridization and the strategy of 
nucleic acid piobe assays," (Elsevier, New Yoric), as well as in Ausubel, supra. Hames 
and Higgins (1995) fiene Probes 1. IRL Press at Oxford University Press, Oxford, 

20 England (Hames and Higgins 1) and Hames and Higgins (1995) Ggne Probes 2. IRL 

Press at Oxford University Press, Oxford, England (Hames and Higgins 2) provide details 
on the synthesis, labeling, detection and quantification of DNA and RNA, including 
oligonucleotides. 

"Stringent hybridization wash conditions" in the context of nucleic acid 
25 hybridization experiments, such as Southern and northern hybridizations, are sequence 
dependent, and are different under different environmental parameters. An extensive 
guide to tiie hybridization of nucleic acids is found in Tijssen (1993), supra, and in 
Hames and Higgins 1 and Hames and Higgins 2, siqtra. 

For purposes of die present invention, generally, "highly stringent" 
30 hybridization and wash conditions are selected to be about 5° C or less lower tiian die 
thermal melting pofait (T«) for the specific sequence at a defined ionic strengtii and pH (as 
noted below, highly stringent conditions can also be referred to in comparative tenns). 
•Die Tm is the temperature (under defined ionic strengtii and pH) at which 50% of the test 
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sequence hybridizes to a perfectly matched primer. Very stringent conditions are selected 
to be equal to the for a particular primer. 

The Tm is the temperature of the nucleic acid duplexes indicates the 
temperature at which the duplex is 50% denatured under the given conditions and its 
represents a direct measure of the stability of the nucleic acid hybrid. Thus, the T^ 
corresponds to the temperature corresponding to the midpoint in transition from helix to 
random coil; it depends on length, nucleotide composition, and ionic strength for long 
stretches of nucleotides. 

After hybridization, unhybridized nucleic acid material can be removed by 
a series of washes, the stringency of which can be adjusted depending upon the desired 
results. Low stringency washing conditions (e.g., using higher salt and lower 
temperature) increase sensitivity, but can product nonspecific hybridization signals and 
high background signals. Higher stringency conditions (e,g-, using lower salt and higher 
temperature that is closer to the hybridization temperature) lowers the background signal, 
typically with only the specific signal remaining. See, Rapley, R. and Walker, J.M. eds.. 
Molecular Biomethods Handbook (Humana Press, Inc. 1998) (hereinafter '"Rapley and 
Walker"), which is incorporated herein by reference in its entirety for all purposes. 

Tlius, one measure of stringent hybridization is the ability of the prim^- to 
hybridize to one or more of the target nucleic acids (or complementary polynucleotide 
sequences thereoO under highly stringent conditions. Stringent hybridization and wash 
conditions can easily be determined empirically for any test nucleic add. 

For example, in determining highly stringent hybridization and wash 
conditions, the hybridization and wash conditions are gradually increased (e.g., by 
increasing temperature, decreasing salt concentration, increasing detergent concentration 
and/or increasing the concentration of organic solvents, such as formalin, in the 
hybridization or wash), until a selected set of criteria are met. For example, the 
hybridization and wash conditions are gradually increased until a target nucleic acid, and 
complementary polynucleotide sequences thereof, binds to a perfectiy matched 
complementary nucleic acid. 

A target nucleic acid is said to specifically hybridize to a primer nucleic 
acid when it hybridizes at least V2 as well to the primer as to a perfectly matched 
complementary target, i.e., wifli a signal to noise ratio at least Vz as high as hybridization 
of the primer to the target und^ conditions in which the perfectiy matched primer binds 
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to the perfectly matched complementary target with a signal to noise ratio that is at least 
about 2.5x-10x, typically 5x-10x as high as that observed for hybridization to any of the 
unmatched target nucleic acids. 

Optionally, primers are designed such that the annealing temperature of 
5 the universal sequence is higher/greater than that of the target-specific sequences. 

Method employing these primers further Include increasing the annealing temperature of 
the reaction after the firat few rounds of amplification. This increase in reaction 
temperature suppresses further amplification of sample nucleic acids by the TSPs, and 
drives amplification by the UP. Depending on the application envisioned, one skilled in 
10 the art can employ varying conditions of hybridization to achieve varying degrees of 
selectivity of primer towards the target sequence. For example, varying the stringency of 
hybridization or the position of primer hybridization can reveal divergence within gene 
families. 

Optionally, each candidate primer is shown or proven to be compatible 
15 with the other primers used in a multiplex reaction. In a preferred embodiment, each 
target-specific primer pair produces a single ampUfication product of a predicted size 
from a sample minimally containing all of Uie targets of die multiplex, and more 
preferably fmm a crude RNA mixnirc. Preferably, amplification of each individual target 
by its corresponding primers is not inhibited by inclusion of any otiier primers in tiie 
20 multiplex. None of the primers, cither individually or in combination, should produce 
spurious products. These issues are easily addressed by one of skill in the art witfiom the 
need for excessive undue experimentation. 

Inherent Properties and Labels 

Primer sequences are optionally designed to accommodate one or more 
25 detection techniques that can be employed while performing the methods of the present 
invention. For example, detection of die amplification products is optionally based upon 
one or more inherent properties of the amplification products tiiemselves, such as mass or 
mobility. Otiier embodiments utilize methods of detection based on monitoring a label 
associated with the PGR products. In these embodiments, generally one or more of die 
30 universal primers contains die label. Optionally, the label is a fluorescent chromaphore. 
A fluorescent label may be covalentiy attached, noncovalendy intercalated, or may be an 
energy transfer label. Other useful labels include mass labels, vMch are incorporated into 
amplification products and released after die reaction for detection, chemiluminescoit 

26 



wo 01/55454 FCT/USOl/02757 

labels, electrochemical and infrarcd labels, isotopic derivatives, nanocrystals, or any of 
various enzyme-linked or substrate-linked labels detected by the appropriate enzymatic 
reaction. 

One preferred embodiment of the methods of the present invention 
S includes the use and detection of one or more fluorescent labels. Generally, fluorescent 
molecules each display a distinct emission spectrum, thereby allowing one to employ a 
plurality of fluorescent labels in a multiplexed reaction, and then separate the mixed data 
into its component signals by spectral decon volution. Exemplary fluorescent labels for 
use in the methods of the present invention include a single dye covalently attached to the 
10 molecule being detected, a single dye noncovalently intercalated into product DNA, or an 
energy-transfer fluorescent label. 

Other embodiments of labeling include mass labels, which are 
incorporated into amplification products and released after the reaction for detection; 
chemiluminescent, electrochemical, and infrared labels; radioactive isotopes; and any of 
IS various enzyme-linked or substrate-linked labels detectable by the appropriate enzymatic 
reaction. Many other useful labels are known in the ait, and one skilled in the art can 
envision additional strategies for labeling amplification products of the present invention. 

Cleavable Linkages and Size-Shifting of Amplification Products 
Primers can also be designed to produce amplification products having 

20 sizes which can selectively be changed, or "shifted" after amplification, in order to better 
resolve the amplification products prior to or during detection. For example, a primer can 
be designed to incorporate a restriction enzyme site within a portion of the amplified 
product The products of this reaction can then optionally be cleaved enzymatically to 
generate size-shifted amplification products. Altematively, primers can be designed to 

25 incorporate various chemically-cleavable linkages, mass labels, or other linkers which can 
optionally be used in the detection of one or more of the amplification products. 

Linking groups, or linkers, can also be incorporated into the primers of the 
present invention. Linking groups of use in the present invention can have a range of 
structures, substituents and substitution patterns. They can, for example be derivitized 

30 with nitrogen, oxygen and/or sulfur containing groups which are pendent from, or integral 
to, the linker group backbone. Examples include, polyethers, polyacids (polyacrylic acid, 
polylactic acid), polyols (e.g.» glycerol, ), polyamines (e.g., spermine, spermidine) and 
molecules having more than one nitrogen, oxygen and/or sulfiir moiety (e.g„ 1,3- 
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diamino-2-piopanol, taurine). See, for example, Sandler et al. Organic Functional Group 
Preparations 2nd Ed., Academic Press. Inc. San Diego 1983. 

Methods for preparing linkers that can be incorporated into primers for use 
in the methods of the present invention are known in the art. Numerous Knking groups 
compatible with phosphoramidite chemistry are commercially available (Glen Research, 
SterUng, VA) and can readily be incorporate into oligonucleotides during automated 

synthesis procedures. 

One of skill will recognize that a linker that is appropriate for 
incorporation into a nucleic acid oligomer synthesis can also be utilized to derivatize a 
nucleic acid monomer. For example, chemically cleavable primers can be used in the 
ampUfication step of the methods of the present invention. In these embodiments, one or 
more of the primers used in amplification contain a chemical linkage, such as a 
thiophosphate moiety, that can be selectively cleaved, generating two separate fragments 
from the primer. Cleavage is optionally performed after die amplification reaction, e.g., 
by removing a fixed number of nucleotides from die 5 ' end of products made from diat 
primer. Design and use of such primers is described in detail in, for example, U et al 
(Electiophoresis (1999) 20:1258-1265), PCT pubUcation WO 96/37630 (Monforte et al.) 
and U.S. Patent Nos. 5.700,642 (Monforte et al.) and 6.090,558 (Butler et al.), which are 
incorporated herein by reference in their entirety for all purposes. 

Exemplary Primer Designs for Use in a Multiplexed Amplication 
Reaction 

A pref ened embodiment of the invention utilizes a combination of TSPs 
that wUl hybridize with one of a plurality of designated target sequences, and universal 
primers (UPs) for amplification of multiple targets in the multiplexed reaction. 
Optionally, the primary way of separating the signals of the multiplexed amplification is 
according to product sizes. Alternatively, the signals can be resolved using differential 
labeling to separate signals from products of similar size. To separate products according 
to size, the predicted sizes must be considered in primer design. Figure 1 iUustrates the 
elements of design of tiiese primers. Each of the TSPs has a universal sequence widiin 
the 5' region, which is shared among the primers, but not contirined in the original 
template (i.e. the target sequence). This universal sequence may be the same or different 
for die forward and reverse TSPs. Following the 3' end of die universal sequence is a 
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target-Specific sequence for annealing to and amplifying the target sequence (e.g., gene) 
of interest. 

The universal primer is composed of the universal sequence held in 
common within the 5' regions of the TSPs. If a single UP is to be used, the universal 
sequence will be the same within all TSPs. If a UP pair is to be used, the universal 
sequence will be different in the forward and reverse primers of the TSPs- The UP may 
also contain a detectable label on at least one of the primers, such as a fluorescent 
chromaphore. Both the target-specific and universal sequences are of sufficient length 
and sequence complexity to form stable and specific duplexes, allowing amplification and 
detection of the target gene. 

Elimination of Variations in Primer Annealing Efficiency 

Variations in primer length and sequence can also have a large impact on 

the efficiency with which primers anneal to their target and prime replication. In a typical 

multiplexed reaction in which each product is amplified by a unique primer pair, the 

relative quantities of amplified products may be significantly altered from the relative 

quantities of targets due to difference in annealing efficiencies. Embodiments of the 

methods of the present invention that couple the use of target-specific primers and 

universal primers eliminates this bias, producing amplification products that accurately 

reflect relative mRNA levels. 

COUPLED TARGET>SPECIFIC AND UNIVERSAL PRIMING OF THE PGR 

In the methods of the present invention, the amounts of each designated 
target are amplified to improve the sensitivity and dynamic range of the assay. In some 
embodiments to monitor gene expression, cellular RNA is isolated and reverse 
transcribed to obtain cDNA, which is then used as template for amplification. In other 
embodiments, cDNA may be provided and used directiy. The primers described for use 
in the present invention can be used in any one of a number of template-dependent 
processes that amplify sequences of the target gene and/of its expressed transcripts 
present in a given sample. Other types of templates may also be used, such as iRNA, 
rRNA, or other transcription products, genomic DNA, viral nucleic acids, and synthetic 
nucleic acid polymers. Several methods described below are contemplated. 

A preferred embodiment of the metiiods of the present invention employs 
PGR, which is described in detail in U.S. Patent Nos. 4,683,195 (Mullis et al.), 4,683,202 
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(Muffis), and 4.800.159 (MulUs et al.). and in PCR Protocols A Cmide \q Methods and 
Applications (Innis et al.. cds.) Academic Pfess Inc. San Diego, CA (1990). PCR utilizes 
pairs of primers having sequences compUmentary to opposite strands of target nucleic 
acids, and positioned such tiiat the primers are converging. The primers are incubated 
with template DNA under conditions that permit selective hybridization. Primers may be 
provided in double-stranded or single-stranded form, altiiough the single-stranded form is 
prefeiTEd. If the target gene(s) sequence is present in a sample, the primers will hybridize 
to form a nucleic-acid:primer complex. An excess of deoxynucleoside triphosphates is 
added, along with a thermostable DNA polymerase, e.g. Taq polymerase. If the taiget 
gene(s):primer complex has been formed, the polymerase will extend the primer along the 
target gene(s) sequence by adding nucleotides. After polymerization, the newly- 
synthesized strand of DNA is dissociated from its complimentary template strand by 
raising the temperature of the reaction mixture. When the temperature is subsequendy 
lowered, new primera will bind to each of these two strands of DNA, and die process is 
repeated. Multiple cycles of raising and lowering the temperature are conducted, witii a 
round of replication in each cycle, until a sufficient amount of ampUfication product is 
. produced. 

Figure 2 iUustiates the TSP-UP coupled priming strategy. Heavier lines 
represent a DNA template; tiiinner Unes depict tfie oligonucleotide primers. Primer 
nomenclature is as described in the legend to HG. 1. n»e lower case "f ' and V in the 
primer names indicate a forward or reverse orientation, lines "A," "B," "C," and "D" 
represent unique nucleic acid sequences, and "A'," "BV "C," and "D- indicate their 
respective complementary sequences. "B" and "C" sequences derive from die template; 
"A" and "D" sequences derive from universal primer sequences. Anowheads indicate 
directionality. A vertical bar indicates an endpoint of ti^e DNA strand. The first set of 
reactions (first airow) occur in the eariy PCR cycles (for example, in only die first and 
second PCR cycles); in tiiese reaction, primarily the TSPs are used as primers, and the 
lesulting products wUl have UP sequences added to both ends, flanking die amplified 
target sequence. The second set of reactions (second, reiterative arrow) occur in all 
subsequent PCR cycles; botii TSP and UP primers are used, but tiie UPs dominate when 
present in molar excess over the TSPs. 

In early rounds of the an^lification, replication is primed primarily by tiie 
TSPs. The first round will add the univeisal sequence to die 5* regions of die 
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amplification products. The second cycle will generate sequence complementary to the 
universal sequence within the 3' region of the complementary strand, creating a template 
that can be amplified by the universal primers alone. Optionally, the reaction is designed 
to contain limiting amounts of each of the TSPs and a molar excess of the UP, such that 
5 the UP will generally prime replication once its complementary sequence has been 

established in the template. The molar excess of UP over a TSP can range from about 5:1 
to about 100:1; optionally, the reaction utilizes approximately 10:1 molar excess of UP 
over the amount of each TSP. Because all of the TSPs contain the same universal 
sequence, the same universal primer will amplify all targets in the multiplex, eliminating 
10 the quantitative variation that results from amplification from diffeient primers. 

Amplification Methods 

In a preferred embodiment of the methods of the present invention, RNA 
is converted to cDNA using a target-specific primer complementary to the RNA for each 
gene target being monitored in the multiplex set in a reverse-transcription (RT) reaction. 

15 Methods of reverse transcribing RNA into cDNA are well known, and described in 
Sambrook, supra. Alternative methods for reverse transcription utilize thermostable 
DNA polymerases, as described in the art. As an exemplary embodiment, avian 
myeloblastosis virus reverse transcriptase (AMV-RT), or Maloney murine leukemia virus 
reverse transcriptase (MoMLV-RT) is used, although other enzymes are contemplated. 

20 An advantage of using target-specific primers in the RT reaction is that only the desired 
sequences are converted into a PGR template. No superfluous primers or cDNA products 
are carried into the subsequent PGR amplification. 

In another embodiment of the amplifying step, RNA targets are reverse 
transcribed using non-specific primers, such as an anchored oligo-dT primer, or random 

25 sequence primers. An advantage of this embodiment is that the "unfractionated" quality 
of the mRNA sample is maintained because the sites of priming are non-specific, i.e., the 
products of this RT reaction will serve as template for any desired target in the 
subsequent PGR amplification. This allows samples to be archived in the form of DNA, 
which is more stable than RNA. 

30 In other embodiments of the methods of the present invention, 

transcription-based amplification systems (TAS) are used, such as that first described by 
Kwoh et al. (Rroc. Natl. Acad. Sci. (1989) 86(4): 1173-7), or isothermal transcription- 
based systems such as 3SR (Self-Sustained Sequence Replication; Guatelli et al. (1990) 
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Proc. NaU. Acad. Sci. 87:1874-1878) or NASBA (nucleic acid sequence based 
ampUfication; Baevits et al. (1991) J Virol Methods. 35(3):273-86). In these methods, the 
mRNA target of interest is copied into cDNA by a reverse transcriptase. The primer for 
cDNA synthesis includes the promoter sequence of a designated DNA-dependent RNA 
5 polymerase 5' to the primer's region of homology with the template. The resulting cDNA 
products can then serve as templates for multiple rounds of transcription by the 
appropriate RNA polymerase. Transcription of the cDNA template rapidly amplifies the 
signal from the original target mRNA. The isothermal reactions bypass the need for 
denaturing cDNA strands from their RNA templates by including RNAse H to degrade 

10 RNA hybridized to DNA. 

In other embodiments, amplification is accomplished by used of the ligase 
chain reaction QJCR), disclosed in European Patent Application No. 320,308 (Backman 
and Wang), or by tiie ligase detection reaction (LDR), disclosed in U.S. Patent No. 
4,883,750 (Whiteley et al.). In LCR. two probe pairs are prepared, which are 

15 complimentary each othw, and to adjacent sequences on both strands of the target. Each 
pair will bind to opposite strands of the target such that they abut. Each of the two jHObe 
pairs can then be linked to fwm a single unit, using a thermostable ligase. By 
temperature cycling, as in PGR , bound ligated units dissociate fipom the target, then both 
molecules can serve as "tar^t sequences" for ligation of excess probe pairs, providing for 

20 an exponential amplification. The LDR is very similar to LCR. In this variation, 

oligonucleotides complimentary to only one strand of the target are used, resulting in a 
Unear ampUfication of ligation products, since only the original target DNA can swve as a 
hybridization template. It is used following a PGR amplification of the target in order to 
increase signal. 

25 In fijnher embodiments, several methods generally known in the art would 

be suitable methods of an^lification. Some additional examples include, but are not 
limited to, strand displacement amphfication (Walker et al. (1992) Nucleic Acids Res. 
20:1691-16%), repair chain reaction (REF), cyclic probe reaction (REF), solid-phase 
amplification, including bridge amplification (Mehta and Singh (1999) BioTechniques 

30 26(6): 1082-1086), rolling circle amplification (Kool, U.S. Patent No. 5.714320). rapid 
amplification of cDNA ends (Frohman (1988) Proc. Natl. Acad. Sci. 85: 8998-9002). and 
the "Invader assay" (Griffin et al. (1999) Proc. Natl. Acad. Sci. 96: 6301-6306). 
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Attenuation of Strong Signals 

The set of targets included in a multiplex reaction generally all yield signal 
strengths within the dynamic range of the detection platform used in order for 
quantitation of gene expression to be accurate. In some embodiments, it may be desirable 
S or necessary to include a very highly expressed gene in a multiplex assay. However, the 
highly-expressed gene can impact the accuracy of quantitation for other genes expressed 
at very low levels if its signal is not attenuated. The methods of the current invention 
provide ways for attenuating the signals of relatively abundant targets during the 
amplification reaction such that they can be included in a multiplexed set without 

10 impacting the accuracy of quantitation of that set 

Toward this end, amplification primers are optionally used that block 
polymerase extension of the 3' end of the primer. One preferred embodiment is 
modification of the 3*-hydroxyl of the oligonucleotide primer by addition of a phosphate 
group. Another preferred embodiment is attachment of the terminal nucleotide via a 3'-3' 

15 linkage. One skilled in the art can conceive of other chemical structures or modifications 
that can be used for this purpose. The modified and the corresponding unmodified primer 
for the highly abundant target are mixed in a ratio empirically detennined to reduce that 
target's signal, such that it falls within the dynamic range of other targets of the multiplex. 
Preferably, the reverse target-specific primer is modified, thereby attenuating signal by 

20 reduction of the amount of template created in the reverse transcriptase reaction. 

Anoth^ embodiment for signal attenuation entails use of a target-specific 
primer tiiat contains the target-specific sequence, but no universal primer sequence. This 
abbreviated primer (sans universal sequence) and the corresponding primer containing 
the universal sequence within the 5* region are mixed in a ratio empirically determined to 

25 reduce that targe's signal, such that it tiien falls within the dynamic range of other targets 
of the multiplex system. 

MULTIPLEX AMPLIHCATIQN STRATEGIES 

An important embodiment of the methods of the present invention 
involves the use of various PCR multiplexing strategies that are made possible by tiie 
30 combined use of taiget-specific and universal primers. An illustration of the fundamental 
multiplexed reaction is shown in Figure 3. 

The numbers 1 through 6 on the left represent six different reactions 

occurring simultaneously in a single mixture. Column A represents the six target 
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sequences of the multiplex. Column B depicts the templates and primers in the PGR 
an^lification. Unes shown as parallel and having opposite directionaUty represent 
complementary sequences. The templates are initially single-stranded mRNA molecules, 
but cventuaUy arc predominantly DNA amplification products that serve as template in 
5 'subsequent cycles. Messenger RNA is converted to cDNA by the action of reverse 
transcriptase polymerization from the target-specific reverse primers (TSPrl-6) for each 
of the six targets. The six target-specific forward primers CTSPf 1-6) and the universal 
forward and reverse primers (UPf 1-6. UPrl-6) are added along with a thermostable 
polymerase to generate the second strand of cDNA. followed by PGR amplification. The 

10 drawings in Column B show single-stranded templates with the TSPs aligned (depicted as 
parallel) at their sites of hybridization. The UP can anneal to target DNA only after its 
complementary universal sequence is added to the opposite strand through replication 
across the 5* region of the TSP. Column C shows the products of PCR amplification. 
Products contain the target sequences {TSl-6) that were the targets of ampUfication, 

15 flanked by the universal primer sequences (UP) that were added to the ends of the target 
sequences by the target-specific primers. The TSPf and TSPr primers are specific, so by 
definition they wUl all be unique. However, the two universal primers may be the same 
sequence as each other or different sequences, i.e., the UPf may be the same sequence as 
the UPr. Furthermore, subsets of target sequences in the multiplex set may be amplified 

20 by different UPs, ut., the UPf 1-6 primers and/or UPrl-6 primers may be of one or 

multiple sequences. 

All of these examples are variations on the fundamental RT-PCR assay 
shown in Figure 3. For the sake of simplicity, only strategies using fluorescent dyes are 
illustrated, although many of the other labeUng strategies previously discussed could be 
25 {^plied. 

DATACQIXECnON 

The number of species than can be detocted^ithin a-mixture depends 
primarily on the resolution capabilities of the separation platform used, and the detection 
methodology employed. A preferred embodiment of the separation step of tfie metiiods 
30 of the present invention is based upon size-based sqiaration technologies. Once 

separated, individual species arc detected and quantitated by either inherent physical 
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characteristics of the molecules themselves, or detection of a label associated with the 
DNA. 

Embodiments employing other separation methods are also described. For 
example, certain types of labels allow resolution of two species of the same mass through 
5 deconvolution of the data. Non-size based differentiation methods (such as 

deconvolution of data from overlapping signals generated by two different fluorophores) 
allow pooling of a plurality of multiplexed reactions to further increase throughput. 

Optionally, the throughput rate for the detection step is between about 100 
and 5000 samples per hour, preferably between about 250 and 2500 samples, and more 

10 preferably about 1000 samples per hour per separation system (i.e., one mass 

spectrometer, one lane of a gel, or one capillary of a capillary electrophoresis device). In 
order to further reduce assay costs and increase the throughput of the overall process, 
sample-handling is optionally conducted in a miniaturized format. For the methods of the 
IMBScnt invention, miniaturized formats are those conducted at submicroliter volumes, 

15 including both microfluidic and nanofluidic platforms. Any or all of the amplification, 
separation, and/or detection steps of the present can utilize miniaturized formats and 
platforms. For example, many of the modes of separation described below are presently 
available in a miniaturized scale. 

Separation Methods 

20 Preferred embodiments of the present invention incorporate a step of 

separating the products of a reaction based on their size differences* The PCR products 
generated during the multiplex amplification optionally range from about SO to about SOO 
bases in lengthy which can be resolve from one another by size. Any one of several 
devices may be used for size separation, including mass spectrometry, any of several 

25 electrophoretic devices, including capillary, polyacrylamide gel, or agarose gel 
electrophoresis, or any of several chromatographic devices, including column 
chromatography, HPLC, or FPLC. 

One preferred embodiment for sample analysis is mass spectrometry. 
Several modes of separation that determine mass are possible, including Time-of-Flight 

30 (TOF), Fourier Transform Mass Spectrometry (FTMS), and quadruple mass 

spectrometry. Possible methods of ionization include Matrix-Assisted Laser Desorption 
and Ionization (MALDI) or Electrospray Ionization (ESI). A preferred embodiment for 
the uses described in this invention is MALDI-TOF (Wu, ct al. (1993) Rapid 
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Communications in Mass Spectrometry 7: 142-146). This method may be used to 
provide unfragmented mass spectra of mixed-base oligonucleotides containing between 
about 1 and about 1000 bases. In preparing the sample for analysis, the analyte is mixed 
into a matrix of molecules that resonantly absorb Ught at a specified wavelength. Pulsed 

5 laser light is then used to desorb oligonucleotide molecules out of the absorbing solid 
matrix, creating free, charged oligomers and minimizing fragmentation. The preferred 
solid matrix material for this purpose is 3-hydroxypicolinic add (Wu, supra), although 
others are contemplated. 

In another preferred embodiment, the device of the invention is a 

10 microcapillary for analysis of nucleic acids obtained from the sample. Microcapillary 
electrophoresis generally involves the use of a thin capillary or channel, which may 
optionally be filled with a particular medium to improve separation, and employs an 
electric field to separate components of the mixture as the sample travels through the 
capillary. Samples composed of linear polymers of a fixed chargp-to-mass ratio, such as 

15 DNA, will separate based on size. The high surface to volume ratio of these capillaries 
allows application of very high electric fields across the capillary without substantial 
thermal variation, consequentiy allowing very rapid separations. When combined with 
confocal imaging methods, these methods provide sensitivity in tiie range of attomoles, 
comparable to the sensitivity of radioactive sequencing methods. The, use of 

20 microcapillary electrophoresis in size separation of nucleic acids has been reported in 
Woolley and Mathies (Proc. NaU. Acad. Sci. USA (1994) 91:11348-11352). 

Capillaries are optionally fabricated from fused silica, or etched, 
machined, or molded into planar substrates. In many microcapillary electrophoresis 
methods, the capillaries are filled with an appropriate separation/sieving matrix. Several 

25 sieving matrices are known in the art that may be used for this application, including, e.g. , 
hydroxyethyl ceUulose, polyacrylamide, agarose, and the like. Generally, the specific gel 
matrix, running buffers and running conditions are selected to obtain the separation 
required for a particular application. Factors that are considered include, e.g., sizes of the 
nucleic acid fragments, level of resolution, or the presence of undenatured nucleic acid 

30 molecules. For example, running buffers may include agents such as urea to denature 
double-stranded nucleic acids in a sample. 

Microfluidic systrans for separating molecules such as DNA and RNA are 
commetcially available and are optionally employed in the metfiods of the present 
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invention. For example, the "Personal Laboratory System" and the "High Throughput 
System" have been developed by Caliper Technologies, Coip. (Mountain View, CA), 
The Agilent 2100, which uses Caliper Technologies' LabChip™ microfluidic systems, is 
available from Agilent Technologies (Palo Alto, OA). Currently, specialized microfluidic 
5 devices which provide for rapid separation and analysis of both DNA and RNA are 
available from Caliper Technologies for the Agilent 2100. See, e.g., 
http://www.calipertech .com. 

Other embodiments are generally known in the art for separating PGR 
amplification products by electrophoresis through gel matrices. Examples include 

10 polyacrylamide, agarosc-acrylamide, or agarose gel electrophoresis, using standard 
methods (Sambrook, supra). 

Alternatively, chromatographic techniques may be employed for resolving 
amplification products. Many types of physical or chemical characteristics may be used 
to eifect chromatographic separation in the present invention, including adsorption, 

15 partitioning (such as reverse phase), ion-exchange, and size exclusion* Many specialized 
techniques have been developed for their application including methods utilizing liquid 
chromatography or HPLC (Katz and Dong (1990) BioTechniques 8(5):546-55; Gaus et al. 
(1993) J. Immunol. Methods 158:229-236). 

In yet another embodiment of the separation step of the present invention, 

20 cDNA products are captured by their affinity for certain substrates, or other incorporated 
binding properties. For example, labeled cDNA products such as biotin or antigen can be 
captured with beads bearing avidin or antibody, respectively. Affinity capture is utilized 
on a solid support to enable physical separation. Many types of solid supports are known 
in the art that would be applicable to the present invention. Examples include beads (e.g. 

25 solid, porous, magnetic), surfaces (e.g, plates, dishes, wells, flasks, dipsticks, 
membranes), or chromatographic materials (e.g. fibers, gels, screens). 

Certain separation embodiments entail the use of microfluidic techniques. 
Technologies include separation on a microcapillary platform, such as designed by 
ACLARA Biosciences Inc. (Mountain View, CA), or the LabChip™ microfluidic 

30 devices made by Caliper Technologies Inc. Another recent technology developed by 
Nanogen, Inc. (San Diego, CA), utilizes microelectronics to move and concentrate 
biological molecules on a semiconductor microchip. The microfluidics platforms 
developed at Orchid Biosciences, Inc. (Princeton, NJ), including the Chemtel™ Chip 
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which provides for parallel processing of hundreds of reactions, can be used in the present 
invention. These microfluidic platforms require only nanolito- sample volumes, in 
contrast to the microliter volumes required by other conventional separation technologies. 
Fabrication of microfluidic devices, including microcapillary 
5 electtophoretic devices, has been discussed in detail, e.g., Regnier et al. (Trends 
Biotechnol. (1999) 17(3):101-6), Deyl et al. (Forensic Sci. Int. (1998) 92:89-124), 
Effenhauser et al. (Electrophoresis (1997) 18:2203-2213), and U.S. Patent No. 5,904.824 
(Oh). Typically, the methods make use of photolithographic etching of micron-scale 
channels on a silica, silicon, or other crystalline substrate or chip. In some embodiments, 
10 capillary anays may be fabricated using polymeric materials with injection-molding 
techniques. These methods can be readily adapted for use in miniaturized devices of the 

present invention. 

Some of die processes usually involved in graetic analysis have been 

miniaturized using microfluidic devices. Fbr example. PCT publication WO 94/05414 
15 reports an integrated micro-PCR apparatus for collection and amplification of nucleic 

acids from a specimen. U.S. Patent Nos. 5,304,487 (WUding et al.) and 5,296,375 

(Kricka et al.) discuss devices for collection and analysis of cell-containing samples. U.S. 

Patent No. 5,856.174 (Lipshutz et al.) describes an apparatus tiiat combines tiie various 

processing and analytical operations involved in nucleic acid analj^is. 
20 Additional technologies arc also contemplated. For example. Kasianowicz 

et al. (Proc. Nad. Acad. Sci. USA (1996) 93:13770-13773) describe die use of ion 

channel pores in a lipid bilayer membrane for deteraiining the lengtfi of polynucleotides. 

In tiiis system, an electric field is generated by the passage of ions tiirough tiie pores. 

Polynucleotide lengths are measured as a transient decrease of ionic current due to 
25 blockage of ions passing tiu-ough the pores by die nucleic acid. The duration of the 

cunent decrease was shown to be proportional to polymer lengtii. Such a system can be 

applied as a size separation platform in the present invention. 

The target-specific primers and universal primers of the present invention 

are useful both as reagents for hybridization in solution, such as priming PGR 
30 amplification, as well as fw embodiments employing a solid phase, such as microarrays. 

Witii microarrays, sample nucleic acids such as mRNA or DNA are fixed on a selected 

matiix or surface. PGR products may be attached to *e solid surface via one of die 

amplification primers, then denatured to provide single-stranded DNA. This spatially- 
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partitioned, single-stranded nucleic acid is then subject to hybridization with selected 
probes under conditions that allow a quantitative determination of target abundance. In 
this embodiment, amplification products from each individual multiplexed reaction are 
not physically separated, but are differentiated by hybridizing with a set of probes that are 
5 differentially labeled. Alternatively, unextended amplification primers may be physically 
immobilized at discreet positions on the solid support, then hybridized with the products 
of a multiplexed PCR amplification for quantitation of distinct species within the sample. 
In this embodiment, amplification products arc separated by way of hybridization with 
probes that arc spatially separated on the solid support. 

10 Separation platforms may optionally be coupled to utilize two different 

separation methodologies, thereby increasing the multiplexing capacity of reactions 
beyond that which can be obtained by separation in a single dimension. For example, 
some of the RT-PCR primers of a multiplex reaction may be coupled with a moiety that 
allows affinity capture, while other primers remain unmodified. Samples are then passed 

15 tiux)Ugh an affinity chromatography column to separate PCR products arising from these 
two classes of primers. How-through fractions are collected and the bound fraction 
eluted. Each fraction may then be further separated based on other criteria, such as size, 
to identify individual components. 

The invention also includes r^id analytical method using one or more 

20 microfluidic handling systems* For example, a subset of primers in a multiplex reaction 
would contain a hydrophobic group. Separation is then performed in two dimensions, 
with hydrophilic partitioning in one direction, followed by size separation in the second 
direction. The use of a combination of dyes can further increase the multiplex size. 

Detection Methods 

25 Following separation of the different products of the multiplex, one or 

more of the member species is detected and/or quantitated. Some embodiments of the 
methods of the present invention enable direct detection of products. Other embodiments 
detect reaction products via a label associated with one or more of the amplification 
primers. Many types of labels suitable for use in the present invention are known in the 

30 art, including cherailuminescent, isotopic, fluorescent, electrochemical, inferred, or mass 
labels, or enzyme tags. In further embodiments, separation and detection may be a multi- 
step process in which samples are fractionated according to more than one property of the 
products, and detected one or more stages during the separation process. 
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One embodiment of the invention requiring no labeling or modification of 
the molecules being analyzed is detection of the mass-to-chai:ge ratio of the molecule 
itself. This detection technique is optionally used when the separation platform is a mass 
spectrometer. An embodiment for increasing resolution and throughput with mass 
5 detection is in mass-modifying the amplification products. Nucleic acids can be mass- 
modified through either the amplification primer or the chain-elongating nucleoside 
triphosphates. Alternatively, the product mass can be shifted without modification of the 
individual nucleic acid components, by instead varying the number of bases in the 
primeis. Several types of moieties have been shown to be compatible with analysis by 

10 mass spectrometry, including polyethylene glycol, halogens, alkyl. aryl, or aralkyl 
moieties, peptides (described in, for example. U.S. Patent No. 5,691,141). Isotopic 
variants of specified atoms, such as radioisotopes or stable, higher mass isotopes, are also 
used to vary the mass of die amplification product. Radioisotopes can be detected based 
on the energy released when tiiey decay, and numerous applications of tiieir use are 

15 generally known in the art. Stable (non-decaying) heavy isotopes can be detected based 
on the resulting shift in mass, and are useful for distinguishing between two amplification 
products that would otherwise have similar or equal masses. Other embodiments of 
detection tiiat make use of inherent properties of die molecule being analyzed include 
ultraviolet light absoiption (UV) or electrochemical detection. Electrochemical detection 

20 is based on oxidation or reduction of a chemical compound to which a voltage has been 
applied. Electrons are either donated (oxidation) or accepted (reduction), which can be 
monitored as current. For botii UV absoiption and electrochemical detection, sensitivity 
for each individual nucleotide varies depending on the cranponent base, but wiUi 
molecules of sufficient length tiiis bias is insignificant, and detection levels can be taken 

25 as a direct reflection of overall nucleic acid content. 

Several embodiments of the detecting step of the present invention are 
designed to identify molecules indirectly by detection of an associated label. A number 
of labels may be employed that provide a fluorescent signal for detection (see, for 
example, www.probes.com). If a sufficient quantity of a given species is generated in a 

30 reaction, and the mode of detection has sufficient sensitivity, tiien some fluorescent 
molecules may be incoipwated into one or more of die primers used for amplification, 
generating a signal strength proportional to die concentration of DNA molecules. Several 
fluorescent moieties, including Alexa 350. Alexa 430, AMCA, BODIPY 630/650. 
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BODIPY 650/665, BODIPY-FL. B0DIPY-R6G, BODIPY-TMR, BODIPY-TRX, 
carboxyfluorescein. Cascade Blue, Cy3, Cy5, Fluorescein, HEX, 6-JOE, Oregon 
Green 488, Oregon Green 500, Oregon Green 514, Pacific Blue, REG, Rhodamine Green, 
Rhodamine Red, ROX, TAMRA, TET, Tetramethylrhodamine, and Texas Red, are 
generally known in the art and routinely used for identification of discreet nucleic acid 
species, such as in sequencing reactions. Many of these dyes have emission spectra 
distinct from one another, enabling deconvolution of data from incompletely resolved 
samples into individual signals. This allows pooling of separate reactions that are each 
labeled with a different dye, increasing the throughput during analysis, as described in 
more detail below. 

The signal strength obtained from fluorescent dyes can be enhanced 
through use of related compounds called energy transfer (ET) fluorescent dyes. After 
absorbing light, ET dyes have emission spectra that allow them to serve as "donors" to a 
secondary "acceptor" dye that will absorb the emitted light and emit a lower energy 
fluorescent signal. Use of these coupled-dye systems can significantly amplify 
fluorescent signal. Examples of ET dyes include the ABI PRISM BigDye terminators, 
recently commercialized by Perkin-Elmer Corporation (Foster City, CA) for applications 
in nucleic acid analysis. These chromaphores incorporate the donor and acceptor dyes 
into a single molecule and an energy transfer linker couples a donor fluorescein to a 
dichlororhodamine acceptor dye, and the complex is attached to a DNA replication 
primer. 

Fluorescent signals can also be generated by non^ovalent intercalation of 
fluorescent dyes into nucleic acids after their synthesis and prior to separation. This type 
of signal will vary in intensity as a function of the length of the species being detected, 
and thus signal intensities must be normalized based on size. Several applicable dyes are 
known in the art, including, but not limited to, ethidium bromide and Vistra Green* Some 
intercalating dyes, such as YOYO or TOTO, bind so strongly that separate DNA 
molecules can each be bound with a different dye and then pooled, and the dyes will not 
exchange between DNA species. This enables mixing separately generated reactions in 
order to increase multiplexing during analysis. 

Alternatively, technologies such as the use of nanocrystals as a fluorescent 
DNA label (Alivisatos, et al. (1996) Namre 382:609-1 1) can be employed in the methods 
of the present invention. Another method, described by Mazumder, et al. (Nucleic Acids 

41 



WO01/5S454 PCT/USOl/02757 
Res. (1998) 26:1996-2000), describes hybridization of a labeled oligonucleotide probe to 
its target witiiout physical separation from unhybridized probe. In this metiiod, the probe 
is labeled with a chemiluminescent molecule that in the unbound form is desttoyed by 
sodium sulfite treatment, but is protected in probes that have hybridized to target 
5 sequence. 

In another embodiment, products may be detected and quantitated by 
monitoring a set of mass labels, each of which are specifically associated with one species 
of amplification reaction. The labels arc released by eitiier chemical or enzymatic 
mechanisms after the amplification reaction. Release is followed by size separation of the 

10 mixture of labels to quantitate the amount of each species of the amplification reaction. 
Separation metiiods that can be employed include mass spectrometry, capillary 
electrophoresis, or HPLC. Such strategies, and their appUcations for detection of nucleic 
acids, have been described in, for example, US Patent Nos. 6.104.028 (Hunter et al.) and 
6,051,378 (Monforte et al.). as weU as PCX pubUcations WO 98/26095 (Monforte et al.) 

15 and WO 97/27327 (Van Ness etal.). 

In further anbodiments, both electrochemical and infiwed methods of 
. detection can be an^lified over the levels inherent to nucleic acid molecules tiirou^ 
attachment of EC or IR labels. Their characteristics and use as labels are described in, for 
example, PCX publication WO 97/27327. Some preferred compounds that can serve as 

20 an IR label include an aromatic nitrile, aromatic alkynes, or aromatic azides. Numerous 
compounds can serve as an EC label; many arc listed in FCT publication WO 97/27327. 

Enzyme-Hnked reactions arc also employed in the detecting step of tiie 
methods of the present invention. Enzyme-linked reactions tiieoretically yield an infinite 
signal, due to amplification of die signal by enzymatic activity. In tiiis embodiment, an 

25 enzyme is linked to a secondary group that has a stiwng binding affinity to tiie molecule 
of interest. Following separation of tiie nucleic acid products, enzyme is bound via tiiis 
affinity interaction. Nucleic acids are tiien detected by a chemical reaction catalyzed by 
the associated enzyme. Various coupling strategies are possible utilizing well- 
characteiized interactions generally known in the art, such as those between biotin and 

30 avidin, an antibody and antigoi, or a sugar and lectin. Various types of enzymes can be 
employed, generating colorimetric, fluwescent, chwniluminescent, phosphorescent, or 
othn types of signals. As an illustration, a PGR primer may be synthesized containing a 
biotin molecule. After PGR amplification, DNA products are separated by size, and tiiose 
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made with the biotinylated primer are detected by binding with streptavidin that is 
covalently coupled to an enzyme, such as alkaline phosphatase. A subsequent chemical 
reaction is conducted, detecting bound enzyme by monitoring the reaction product. The 
secondary affinity group may also be coupled to an enzymatic substrate, which is detected 
5 by incubation with unbound enzyme. One of skill in the art can conceive of many 
possible variations on the different embodiments of detection methods described above. 

In some embodiments, it may be desirable prior to detection to separate a 
subset of amplification products from other components in the reaction, including other 
products. Exploitation of known high-affinity biological interactions can provide a 

10 mechanism for physical capture. In some embodiments of this process, the 5' region of 
one of the universal primers contains a binding moiety that allows capture of the products 
of that primer. Some examples of high-affinity interactions include those between a 
hormone with its receptor, a sugar with a lectin, avidin and biotin, or an antigen with its 
antibody. After affinity capture, molecules are retrieved by cleavage, denaturation, or 

15 eluting with a competitor for binding, and then detected as usual by monitoring an 
associated label. In some embodiments, the binding interaction providing for capture 
may also serve as the mechanism of detection. 

Furthennore, the size of an amplification product or products are 
optionally changed, or '^shifted,'* in order to better resolve the amplification products from 

20 other products prior to detection. For example, chemically cleavable primers can be used 
in the amplification reaction. In this embodiment, one or more of the primers used in 
amplification contains a chemical linkage that can be broken, generating two separate 
Segments from the primer. Cleavage is performed after the amplification reaction, 
removing a fixed number of nucleotides from the S' end of products made from that 

25 primer. Design and use of such primers is described in detail in, for example, PCT 
publication WO 96/37630. 

One preferred embodiment of the methods of the present invention is the 
generation of gene expression profiles. However, several other applications are also 
possible, as would be apparent to one skilled in the art from a reading of this disclosure. 

30 For example, the methods of the present invention can be used to investigate the profile 
and expression levels of one or more members of complex gene families. As an 
illustration, cytochrome P-450 isozymes fonn a complex set of closely related enzymes 
that are involved in detoxification of foreign substances in the liver. The various 
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isozymes in this family have been shown to be specific for different substrates. Design of 
tai«et-specific primera that anneal to variant regions in the genes provides an assay by 
which their relative levels of induction in response to drug treatments can be monitored. 
Other examples include monitoring expression levels of alleles with allele-specific 
primers, or monitoring mRNA processing with primers thai specifically hybridize to a 
spliced or unspliced region, or to splice variants. One skilled in the art could envision 
other applications of the present invention that would provide a method to monitor 
genetic variations or expression mechanisms. 

SYSTEMS FOR OENE EXPP T7<;STON ANALYSIS 

The present invention also provides systems for analyzing gene 
expression. The elements of the system include, but are not hmited to, an ampUfication 
module for producing a plurality of amplification products ftom a pool of target 
sequences; a detection module for detecting one or more members of the plurality of 
amplification products and generating a set of gene expression data; and an analyzing 
module for organizing and/or analyzing the data points in the data set. Any or all of these 
modules can comprise high throughput technologies and/or systems. 

The amplification module of the system of the present invention produces 
a plurality of amplification products from a pool of target sequences. The amplification 
module includes at least one pair of universal primers and at least one pair of target- 
specific primers for use in the ampUfication process. Optionally, the amplification 
module includes a unique pair of universal primers for each target sequence. 
Rirthermorc, the amplification module can include components to perform one or more 
of the following reactions: a polymerase chain reaction, a transcription-based 
amplification, a self-sustained sequence replication, a nucleic acid sequence based 
amplification, a ligase chain reaction, a ligase detection reaction, a strand displacement 
amplification, a repair chain reaction, a cyclic probe reaction, a rapid ampUfication of 
cDNA ends, an invader assay, a bridge ampUfication, a rolUng cirele ampUfication, 
solution phase and/or soUd phase ampUficatior^, and the like. 

The detection module detects the presence, absence, or quantity of one or 
more members of the pluraUty of amplification products. Additionally, the detection 
module generates a set of gene expression data, generally in the form of a ploraUty of data 
points. The detection module optionally further comprises a separation module for 
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separation of one or more members of the multiplexed reaction prior to, or during, 
operation of the detection module. The detection module , or the optional separation 
module, can include systems for implementing separation of the amplification products; 
exemplary detection modules include, but are not limited to, mass spectrometry 
S instrumentation and electrophoretic devices. 

The third component of the system of the present invention, the analyzing 
module, is in operational communication with the detection module. The analyzing 
module of the system includes, e.g., a computer or computer-readable medium having one 
or more one or more logical instructions for analyzing the plurality of data points 
10 generated by the detection system. The analyzing system optionally comprises multiple 
logical instructions; for example, the logical instructions can include one or more 
instructions which organize the plurality of data points into a database and one or more 
instructions which analyze the plurality of data points* The instructions can include 
software for performing difference analysis upon the plurality of data points. 
15 Additionally (or alternatively), the instructions can include or be embodied in software 
for generating a graphical representation of the plurality of data points. Optionally, the 
instructions can be embodied in system software which performs combinatorial analysis 
on the plurality of data points. 

The computer employed in the analyzing module of the present invention 
20 can be, e.g., a PC (Intel x86 or Pentium chip- compatible DOS™, OS2™ WINDOWS™ 
WINDOWS NT™. WINDOWS95™, WINDOWS98™ , or WINDOWS MB™), a 
LINUX based machine, a MACINTOSH™, Power PC, or a UNIX based machine (e.g., 
SUN™ work station) or other commercially common computer which is known to one of 
skilL Software for conq>utational analysis is available, or can easily be constructed by 
25 one of skill using a standard progranmiing language such as VisualBasic, Fortran, Basic, 
C, C-H-, Java, or the like. Standard desktop applications such as word processing 
software (e.g., Microsoft Word™ or Corel WordPerfect™) and database software (e.g., 
spreadsheet software such as Microsoft Excel™, Corel Quattro Pro™, or database 
programs such as Microsoft Access™ or Paradox™) can also be used in the analyzing 
30 system of the present invention. 

The computer optionally includes a monitor that is often a cathode ray tube 
("CRT") display, a flat panel display (e.g., active matrix liquid crystal display, liquid 
crystal display), or others. Computer circuitry is often placed in a box that includes 
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numerous integrated circuit chips, such as a naicroprocessor, memory, interface circuits, 
and others. The box also optionaUy includes a hard disk drive, a floppy disk drive, a high 
capacity removable drive such as a writeable CD-ROM, and other common peripheral 
elements. Inputting devices such as a keyboard or mouse optionally provide for input 
from a user and for user selection of sequences to be compared or otherwise manipulated 
in the relevant computer system. 

The computer typically includes appropriate software for receiving user 
instructions, either in the form of user input into a set parameter fields, e.g., in a GUI, or 
in the form of preprogrammed instructions, e.g.. preprogrammed for a variety of different 
specific operations. The software then converts these instructions to appropriate language 
for instructing the operation of the fluid direction and transport controller to carry out the 

desired operation. 

The software can also include output elements for displaying and/or 
further analyzing raw data, massaged data, or proposed results from one or mote 
computational processes involved in the analysis of the gene expression data set. 



In an additional aspect, the present invention provides kits embodying the 
methods, compositions, and systems for analysis of gene expression as described herein. 
Kits of the present invention optionally comprise one or more of the following, preferably 
in a spatially separate anangement: a) at least one pair of universal primers; b) at least 
one pair of target-specific primers; c) at least one pair of reference gene-specific primers; 
and d) one or moi« ampUfication reaction enzymes, reagents, or buffers. Optionally, the 
universal primers provided in the kit include labeled primers, such as those described in 
the present application and the references cited herein. The target-specific primers can 
vary from kit to kit. depending upon the specified target gene(s) to be investigated. 
Exemplary inference gene-specific primers (e.g., target-specific pimers for directing 
transcription of one or more reference genes) include, but are not limited to. primers for 
^actin. cyclophilin, GAPDH, and various rRNA molecules. 

The kits of the invention optionally include one or more preselected primer 
sets that are specific for the genes to be amplified. The preselected primer sets optionally 
comprise one or more labeled nucleic acid primers, contained in suitable receptacles or 
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containers. Exemplary labels include, but are not limited to, a fluorophore, a dye, a 
radiolabel, an enzyme tag, etc, that is linked to a nucleic acid primer itself. 

In one embodiment, kits that are suitable for use in PCR are provided. In 
PCR kits, target-specific and universal primers are provided which include sequences that 
have sequences from, and hybridize to spatially distinct regions of one or more target 
genes. Optionally, pairs of target-specific primers are provided Generally, the target- 
specific primers are composed of at least two parts: a universal sequence within the 5' 
portion that is complementary to a universal primer sequence, and a sequence within the 
3' portion (and optionally, proximal to the universal sequence) for recognition of a target 
gene. In some embodiments of the invention, the set of targets monitored in an analysis 
may be specified by a client for use in a proprietary testing or screening application. In 
an alternate embodiment, standardized target sets may be developed for general 
applications, and constitute components of the kits described below. Kits of either of 
these embodiment can be used to amplify all genes, unknown and/or known, that respond 
to certain treatments or stimuli. 

In addition, one or more materials and/or reagents required for preparing a 
biological sample for gene expression analysis are optionally included in the kit 
Furthermore, optionally included in the kits arc one or more enzymes suitable for 
amplifying nucleic acids, including various polymerases (RT, Taq, etc), one or more 
deoxynucleotides, and buffers to provide the necessary reaction mixture for 
amplification. 

In one preferred embodiment of the invention, the kits are employed for 
analyzing gene expression patterns using mRNA as the starting template. Ttie mRNA 
template may be presented as either total cellular RNA or isolated mRNA; both types of 
sample yield comparable results. In other embodiments, the methods and kits described 
in the present invention allow quantitation of other products of gene expression, including 
tRNA, rRNA, or other transcription products. In still further embodiments, other types of 
nucleic acids may serve as template in the assay, including genomic or extragenomic 
DNA, viral RNA or DNA, or nucleic acid polymers generated by non-rcplicative or 
artificial mechanism, including PNA or RNA/DNA copolymers. 

Optionally, the kits of the present invention further include software to 
expedite the generation, analysis and/or storage of data, and to facilitate access to 
databases* The software includes logical instructions, instructions sets, or suitable 

47 



wo 01/55454 PCT/USOl/02757 

computer programs that can be used in the collection, storage and/or analysis of the data. 

Comparative and relational analysis of the data is possible using the software provided. 

The kits optionally comprise distinct containers for each individual reagent 

and enzyme, as well as for each probe or primer pair. Each component will generally be 
5 suitable as aliquoted in its respective container. The container of the kits optionally 

includes at least one vial, ampule, or test tube. Flasks, bottles and other container 

mechanisms into which the reagents can be placed and/or aliquoted are also possible. 

The individual containers of the kit are preferably maintained in close confinement for 

commercial sale. Suitable larger containers may include injection or blow-molded plastic 
10 containers into which the desired vials are retained. Instructions, such as written 

directions or videotaped demonstrations detailing the use of the kits of the present 

invention, are optionally provided with the kit. 

In a further aspect, the present invention provides for the use of any 

composition or kit herein, for the practice of any method or assay herein, and/or for the 
15 use of any apparatus or kit to practice any assay or method herein. 

EXAMPLES 

Hie methods of the present invention are particularly suited for analyzing 
gene expression patterns. The present invention provides methods for the rapid 
generation of a differential expression profile of a defined set of genes through 

20 comparison of data from multiple reactions. Multiple differential expression profiles can 
be used for comparison of different cell types, or of a single cell type exposed to different 
environmental conditions, or in various developmental or disease states. The methods of 
the present invention provide a way to generate large bodies of differential expression 
data, which can be used for modeling a matrix of gene product interactions for whole 

25 cells. Relational analysis is used with large and complex sets of gene expression profUes, 
and is of valuable for identification of potential therapeutic targets, screening of candidate 
drugs, diagnostics, and other potential uses. 

The methods of thie present invention can also be suitably modified for the 
analysis of other biological processes, including, but not limited to, genotyping, mapping, 

30 mutation analysis, forensics, or analysis of other RNA molecules such as tRNAs. iRNAs, 
orhnRNAs. 
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The following examples are included to demonstrate various embodiments 
of the present invention. It will be appreciated by those of skill in the art that the 
techniques disclosed in the examples which follow represent techniques determined by 
the inventor to function well in the practice of the invention, and thus can be considered 
5 to constitute pieferred modes for its practice. However, those of skill in the art should, in 
light of the present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without departing 
from the spirit and scope of the invention. 

Example 1: Cell Culture and Chemical Exposure 
10 The hepatocyte cell line. Hep G2 (human hepatocellular carcinoma, 

obtained from the American Type Culture Collection, Rockville MD, ATCC#HB-8065), 

was used to evaluate the effects of various chemicals on expression of a set of genes 

known to be involved in cellular toxicological responses. The cells were routinely 

maintained in T75 flasks in Eagle's MEM medium (with non-essential amino acids, 

IS sodium pyruvate, and Earie's salts) and 10% fetal bovine serum at 3TC in a humidified 
atmosphere of 5% C02. The chemicals used in exposure experiments included cadmium 
chloride (CdCl2) and methyl methane sulfonate (MMS). CdC12 is a strong inducer of 
metallothionein, a metal-binding protein, and is known to be carcinogenic and capable of 
interfering with DNA repair. MMS is an alkylating agent that induces DNA damage. 

20 Dilutions of these compounds were prepared from concentrated stocks obtained from 
Aldrich ChCT:iical Company (Milwaukee, WI). Water was used as the solvent control in 
dosing smdies. Approximately 0.02 mL of a dilution of each toxin was added to 2 mL of 
culture medium, with final concentrations ranging from 10-4M to 10-6M CdC12 and from 
0.5 mM to 2 mM MMS. These concentration ranges were empirically detemiined to not 

25 be lethal to cells for the duration of the exposure period. To perform exposures, cells 
were trypsinized and transferred to twelve-well dishes, seeding each well at a density of 
1x104 cells/well. After 4 days of recovery and growth, cells were exposed to the 
designated toxin for 3 hours. Medium was then removed and cells immediately lysed. 
Cell number was quantitated using a dye incorporation assay, CyQUANT from Molecular 

30 Probes (Eugene, OR). 

Example 2: RNA Isolation 

Total RNA was purified from crude cell lysates using Rneasy® total RNA 
purification kits from Qiagen Inc. (Valencia, CA), in an automation-compatible, 96-well 
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fonnat. In order to monitor recovery and stability of RNA from ceU cultures, two 
purified RNA samples (Kanamycin Positive Control BINA from Promega (Madison, WI). 
and 7.5 kb Poly(A)-Tailed RNA from Ufe Technologies (Rockville, MD)) were added 
with the lysis reagents. After the ceUular treatments were complete, growth medium was 
removed and ceUs were lysed under denaturing conditions with RLT buffer (Qiagpn, 
Valencia, CA) containing guanidine isothiocyanate and beta-mercapto ethanol to 
inactivate RNAses. Ethanol was then added to promote binding of RNA to the RNeasy 
membrane, and the entire volumes of the samples were loaded into the wells of a 
multiwell plate. The silica gel membrane of the RNeasy kit specifically binds total RNA. 
allowing contaminants to be washed away in flow-through processing of the membrane 
using a vacuum manifold Samples bound to the membrane were dried by centrifugation 
of the plate. In order to elute RNA. 45 mL of RN Ase-frce water was added to each 
sample well, incubated, collected by centrifugation, and then the elution process repeated. 
Samples were stable in this form, and were stored at -80 "C for later use in expression 
assays. 

Example 3: Reverse Transcriotion to fienea-ate cDNA 

A multiplex primer mix was designed to amplify ten target mRNAs, 

including four controls and six test targets. Two of the controls were endogenous cellular 
mRNAs that exhibit constant expression levels (P-actin and cyclophilin), allowing for 
normalization of signals ftom other genes. Two additional control RNA targets were 
added exogenously in the cell lysis buffer to provide a means to monitor recovery and 
stability of RNA from cell lysates (kanamycin mRNA and the 7.5 kb RNA as previously 
described). Six test genes were chosen that had been shown in prior art to exhibit 
changes in the amount of mRNA transcribed from those genes in response to a specific 
challenge. 

Reverse transcription and PGR"™ amplification primers were designed for 
the gene multiplex set using OLIGO 5.0 (Molecular Biology Insights, Inc., Cascade, CO). 
The sizes of the predicted PCR amplification products of the nine targets ranged from 100 
to 330 bases, with the smaUest size difference being 5 bases. The length of 
complementary sequence between each target-specific primer and its target sequence was 
20 bases, and the length of complementary sequence between the target-specific primers 
and the universal primers was 18 bases. Primers were synthesized by Operon 
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Technologies Inc. (Alameda, CA), or by chemists at GeneTrace Systems Inc. (Alameda, 
CA), utilizing conventional phosphoramidite synthesis techniques. 

A mixtuie of reverse target-specific primers appropriate for the multiplex 
was prepared and diluted to a working concentration of 0.02 fiM. (Reverse priming of P- 
actin mRNA is attenuated by addition of a second, inhibitory reverse target-specific 
primer. See Example 4.) To begin the reverse transcription step, 30 ng of total RNA, 
prepared as described in Example 2, was mixed with the reverse primers, 10 units of 
Moloney Murine Leukemia Virus Reverse Transcriptase (MoMLV-RT, Promega Inc.), 
and deoxyribonucleotides (1 mM from Promega) in an appropriate buffer (20 mM 
TrisHCI, 16,7 mM MgCl2, pH 8.3, and 2.5 units RNasin)» Samples were incubated at 
42°C for 30 minutes, followed by 95°C for 5 minutes to inactivate the enzyme. 

Example 4: Signal Attenuation 

If one of the targets in a multiplex set is present at very high levels, it may 
be necessary to attenuate the signal generated by that target to enstire that all signals fall 
within the dynamic range of the assay. The p-actin mRNA provided one such example, 
as this mRNA is constitutively expressed at very high levels. Amplification of the ^-actin 
signal was attenuated by using a mixture of two target-specific reverse primers, the first 
terminating at the 3* end with a hydroxyl group which is extendible by a reverse 
transcriptase, and the second containing a phosphate group attached to the 3 '-hydroxyl 
which blocks extension by reverse transcriptase. The blocked 0-actin primer was used in 
a 40-fold excess relative to the extendible primer, and the combined concentration was 
equivalent to the concentrations of all other target-specific reverse primers in the 
multiplex. This amount of inhibition typically resulted in about a 70% reduction in 
conversion of mRNA to cDNA. 

Example S: Multiplex Amplification of Target Sequences using a Single. 
Unlabeled Universal Primer 

After inactivation of the reverse transcriptase, the cDNA products were 
used directly as templates in a PGR amplification. A mixture of forward target-specific 
primers appropriate for the multiplex reaction was prepared (SEQ ID No. 1-22). A single 
unlabeled universal primer was used for amplification; both the forward and reverse 
target-specific primers in the multiplex composition were designed to contain the same 
universal sequence within their 5' regions. The forward target-specific primers and the 
universal primer were diluted to a working concentration of 10 nM and SOO nM 
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respectively, and then added to the samples from the reverse transcriptase reaction, along 
with 1 unit TaqGOLD® (Peildn-Elmer Applied Biosystems Inc.. Foster City, CA) and 
375 itU deoxyribonucleotides in an TaqGODD-suppHed buffer. The samples were 
heated at 95*0 for 10 minutes to activate the enzyme, then cycled at appropriate 
tcjnperaiures and for the appropriate number of cycles to achieve ampUfication of the 
designated target sequences, while remaining in the exponential phase of the reaction. 
For example, the samples are amplified for between 30-45 cycles using the following 
temperatures and times. 94-C for 30 sec, SS'C for 30 sec, and 68'C for I min. See Innis, 
siq>ra. 

Example 6: Detection of Amplification Products bv Mass Spectrometry 
After PGR amplification, samples were ready for separation and analysis. 

The method of ionization used for mass spectromctric analysis was Matrix-Assisted Laser 
Desorption and Ionization (MALDI). Mass determinations were made by Time-of-Fbght 
(TOP). A desorption/ionization matrix for analyzing samples was composed of a 9:1 
ratio of saturated hydroxypicolinic acid (HPA) to picolinic acid (PA) (Aldrich) in 25% 
acetonitrile and 25 mM diammonium citrate. A mass spectrometer analysis plate was 
spotted in 384 positions with aliquots of the matrix, which were then allowed to dry 
and/or crystalUze. A defined quantity of an oligonucleotide (e.g., 0.5 fil of a 5-10 nM 
solution, depending on the mass of the oligonucleotide), having a mass witiiin tiie range 
of tfie amplification products, was added to each PGR reaction to serve as an internal 
quantitation standard. An aUquot of approximately 0,5 - 1 /il of each sample was tiien 
pipetted on top of each of the crystallized spots. Samples were allowed to dry again, 
forming DNA:HPA co-crystals. 

The sample plate was placed in the mass spectrometer load lock chamber, 
pumped down to a low vacuum pressure, transferred to the sample chamber, tiien finally 
pumped down further to die required operating vacuum pressure. The sample chamber 
contains an X-Y table to orient the samples under the laser beam, and ion optics to 
accelerate and direct DNA ions into the flight tube and towards the detector. Ionized 
DNA fragments hitting the detector are assigned a mass based on the time required to 
travel through die flight tube. Various parameters were set within the automated data 
coUection software to enable collection of signal in die appropriate mass range, and die 
coordinate positions on die analysis plate for die samples to be examined were entered. A 
laser beam of 355 irai light was focused duough a window in die sample chamber onto 
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the sample being analyzed. The laser power was adjusted to maximize the signal-to-noise 
ratio, while minimizing fragmentation of DNA in the sample. Data was collected 
according to the set parameters, generating a signal spectrum for each sample. The data 
was further processed using signal calling software proprietary to GeneTrace. The 
software smoothed the spectra, identified signal peaks, assigned masses to the peaks, and 
integrated the data to quantitate the relative amount of each species in the sample. These 
values were then normalized to the internal quantitation standard to convert the data to 
absolute values. 

Data generated by the signal calling software was imported into Microsoft 
Excel (Bellevue, WA). Signals from each of the gene products being quantitated were 
normalized to the signal from the reference nucleic acid (the multiplex control target 
taken to have a constant abundance level). When a second reference target was included 
in the multiplex, this signal was also normalized to the first reference, and checked to 
confirm that its abundance relative to the first reference was constant Data was stored in 
tabular fonn as nomialized signal intensities. 

Additional details regarding analysis by mass spectroscopy are presented 
in further examples as detailed below. 

Example 7: Multiplex Amplification using a Single. Labeled Forward 

Universal Primer and an Unlabeled Reverse Universal Primer 

The cDNA products of another toxicology multiplex sample were used as 

templates in a PCR amplification that generated labeled products. A mixture of forward 

target-specific primers appropriate for the multiplex reaction was prepared. These 

primers contained a different universal sequence within their S* regions as that of the 

reverse primers used to generate the cDNA. A forward universal primer was modified by 

covalent attachment of a fluorescein moiety (FAM, available from Peridn-Elmer/Applied 

Biosystems, fiic), while the reverse universal primer remained unlabeled. The forward 

target-specific primers and the universal primers were diluted to a working concentration 

and then added to samples fmm the reverse transcriptase reaction, along with TaqOOLD 

and deoxyribonucleotides in an appropriate buffer. The PCR amplification was carried 

out as described in Example 5. 



53 



PCT/USOl/02757 

WO 01/55454 

l^^am plP «• aenemting aP nnI nf Two Multiplexed Amplifications - Using 
n RiT^ f rl ^. Fnrwafd Ufliv'T^ al PnTier rnnraiiiing One of Two Labels and an 
TInlaheled Re -verse Universal Primer 

The cDNA products of additional toxicology multiplex samples were used 
5 as templates in two PGR amplifications to generate differently labeled products. A 
mixtorc of forward target-specific primers appropriate for the multiplex reaction was 
prepared. These primers contained a different universal sequence within their 5' regions 
as that of the reverse primers used to generate the cDNA. In addition to the fluorescein- 
modified primer described in Example 2. a second preparation of the forward universal 
10 primer was made, modifying it by covalent attachment of a hexachlorofluorescein moiety 
(HEX, Peridn-Elmer/Applied Biosystems. Inc.). The reverse universal primer remained 
unlabeled. The forward target-specifxc primers and the univereal primers were diluted to 
a working concentration (of 10 nM and 500 nM respectively). Forward target-specific 
primers. TaqGOLD and deoxyribonucleotides in an appropriate buffer were added to 
15 samples from the reverse transcriptase reaction. The FAM-modified forward universal 
primer was added to one of the PGR amplification reactions, and the HEX-modified 
forward universal primer was added to the other. PGR amplification was carried out as 
desaibed in Example 5. 

B.^l ^pi,. o prtflcrinn of Am pli fimtion Products bv PolvacTVlfflnide Q^] 

20 TOectrnohoresis . ^ , . . 

After PGR ampUfication using the fluorescently-labeled pnmers, the 

multiplexed samples were ready for analysis by polyacrylamide gel electrophoresis. A 

standard sequencing gel composed of 5% polyacrylamide, and containing 6M urea and 

890 mM Tris-bomte and 2 mM EDTA. was cast for use on an ABI PRISM 377 DNA 

25 Sequencer (Peikiii-Hmer/AppUed Biosystems). Amplification products were diluted and 
mixed with a solution of GeneScan 500 ROX-labeled size standards (PE Applied 
Biosystems. CA) in formamide (1:5). Samples were loaded on the gel. and the 
components of the multiplex reaction mixture were electrophoretically separated by size 
according to standard conditions, for example. 1.5 hours tunning at 2000^. 60 mA 

30 current, 20 W power, gel temperatiire of 5rC. and laser power of 40 mW (ABI 377). 
Fluorescent data was collected by laser scanning across the gel in real time. GeneScan™ 
software was used to quantitate fiuorescent signals from the amplification products, and 
GenotyperTM software (both from Pcrkin-EImer/AppUed Biosystems) was used for 
subsequent calculations and data manipulations. 
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Example 10: Generating a Pool of Two Multiplexed Amplifications - 
Using Two Forward Universal Primers of Different Lengths and with 
Different Labels, and an Unlabeled Reverse Universal Primer 
The cDNA products of other toxicology multiplex samples were used as 

5 template in two PGR amplifications to generate equivalent amplification products of 

slightly offset sizes, both labeled with the same chromaphore. A mixture of forward 

target-specific primers appropriate for the multiplex was prepared. These primers 

contained a different universal sequence at their 5* ends as that of the reverse primers 

used to generate the cDNA. Two forward universal primers were made with the same 

10 universal sequence, but one contained three additional bases at its 5' end One of the 
forward universal primers was modified by covalent attachment of a FAM moiety, and 
the other was modified by covalent attachment of a HEX moiety. The reverse universal 
primer remained unlabeled. The forward target-specific primers and the universal 
primers were diluted to a working concentration. Forward target-specific primers, 

IS TaqGOLD and deoxyribonucleotides in an appropriate buffer were added to samples 
fix)m the reverse transcriptase reaction. One of the labeled forward universal primers was 
added to each of the reactions. PGR amplification was carried out as described in 
Example 4. 

Example 11: Detection of Amplification Products bv Denaturing Capillarv 
20 Electrophoresis 

Two PGR multiplex samples are analyzed by capillary electrophoresis at 

the end of the PGR amplification. The samples were combined, diluted 1 : 10 in CE 

sample dilution buffer (1:5 dilution of fluorescemly labeled ladder in deionized 

formamide). The pooled sample was analyzed on an ABI PRISM 310 Genetic Analyzer, 

25 with capillaries containing P0P4 acrylamide miatrix (PE Perkin-Ehner Applied 

Biosystems, CA). Gomponents of the pooled multiplexes were electrophoretically 

separated by size according to standard conditions. Fluorescent data was collected at 

wavelengtiis appropriate for the FAM and HEX labels. Sizes were assigned to each 

signal peak based on their migration relative to the ROX size standards. 

30 Example 12: Data Analvsis 

The data collected from the FAM and HEX fluorescent signals were 

analyzed using OeneScan analysis software. The fluorescent signals deconvoluted 

to yield infoimation specific for each of the individual fluorophores in the mixture, to 

generate a baseline, to son the signals into "size bins'" relative to the ROX size standards, 
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and to quantitate the amount of DNA represented in each bin. The results from this 
analysis were further processed by Genotyper software (PE AppUed Biosystems. CA) to 
automate the repetitive tasks of data analysis. Sample files from GeneScan were 
imported into Genotyper, which then assigned data to the size ranges programmed by the 

5 operator. The data generated in this manner was stored in tabular form, and then 

imported into Excel. The signals from each of the gene products being quantitated were 
normalized to the signal generated by the internal reference (the multiplex control taiget 
taken to have a constant abundance level). When a second internal reference target was 
included in the multiplex, this signal was also normalized to the first reference, and 

10 checked to confirm that its abundance relative to the first reference was constant Data 
was stored in tabular form as normalized signal intensities. 

Tjp tpmple 13: Multiolftx Analysis of rellularTranscrintion in PC-3 Cells 
ftfter Treatment with Batte ry of Compounds 

Preparation of Target Sequences 
15 pC-3, a human prostate adenocarcinoma cell line (American Type Culture 

Collection, RockvUle, MD) was cultured in T-225 cm^ flasks (Coming Costar Corp., 

Cambridge, MA.) using iCaighn's Nutrient Mixture F-12 (Irvine Scientific, Santa Ana, 

CA.) containing 7% fetal bovine serum (FBS) (Hyclone. Logan, UT) and ImM L- 

glutamine. The cell culture reagents were obtained from Gibco BRL Ufe Technologies 

D 

20 (Grand Island, NY) except where otherwise noted. Cells were maintained at 37 C in a 
humidified cell incubator containing 5% CO2. At approximately 70% confluence, the 
growth media was aspirated and cells were rinsed with D-PBS. Cells were harvested by 
trypsinization, treated with trypan blue exclusion viability stain and counted using a 
hemacytometer. lidded 96-weU noicrotiter culture plates (Becton Dickinson, Franklin 

25 Lakes, NJ.) were then seeded at 5 x 10* cells per well in a 200/tL media volume. Two 
weUs were left empty to allow the later addition of external process controls. Seeded 
plates were incubated for 3 hours (37'C. 5% CO2. in a humidified cell incubator) to allow 
for cell attachment prior to compound addition. 

A set of 80 known drogs ("Killer Plate 1", from MicroSource Discovery 

30 Systems, Inc., Gayloidsville, CT) and an actinomycin-D positive control were solubiUzed 
in 100% DMSO (Sigma Chemical Co., St. Louis, MO.) and diluted to 8x working 
solutions with growtii media prior to cell plate addition. Compounds from a chemical 
Ubrary (in pooled format) and subsequent confirmation of individual compound activities 
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were analyzed at a final concentration of 2,5 fiM in 0.25% DMSO. Positive and vehicle 
control wells were maintained at 0.25% DMSO (v/v) which had no effect on cell growth 
or gene targets. For dose-response analysis, compounds were plated in triplicate and 
analyzed using eight concentrations (between 10 and 3.16 nM in 0.25% DMSO), as 
prepared by serial dilution. After cell attachment was verified by phase contrast 
microscopy, a 25 aliquot of media was removed from the cell plate and an equivalent 
volume of compound working solution (8x) was introduced with mild trituration of the 
well volume, using a MultiMek 96 pipetting station (Beckman Coulter, FuUerton, CA). 
Cell plates were then returned to the incubator for a 24 hour exposure period. 

Lysis buffer was prepared by adding 145mM B-mercaptoethanol (Sigma 
Chemical Co., St Louis, MO) and external mRNA controls (to a final concentration of 
500 fM) to RLT Lysis buffer (Qiagen, Valencia, CA)* Two external mRNA controls 
were used: 7.5 kb poly(A)-tailed RNA and 1.2 kb Kanamycin Positive Control, which 
were treated with DNAse to ensure that no contaminating DNA was present. Following a 
24 hour incubation period, cell media was aspirated from all wells using an EL-404 plate 
washer (BioTck Instruments, Winooski, VT). Lysis buffer (100/aL) was pipetted into each 
well containing cells. Plates were then mixed on an oAital shaker (Labline, Melrose 
Paiic, XL) for 15 seconds. Adhesive aluminum foil strips (E&K Scientific, Campbell, CA) 
were used to seal the plates prior to frozen storage at -20°C. 

For gene expression analysis, the cell lysates were thawed, and total RNA 
was purified in automated 96-well format using the Qiagen RNeasy 96 kit according to 
the manufacturer's recommended procedure. RNA concentrations were determined 
fluorometrically using RiboGreen reagent (Molecular Probes, Eugene, OR), adjusted in 
concentration, and aliquoted in 30ng amounts into 96-well plates for assay. Total RNA 
yields ranged from 0,45 to 1,8 jig per well depending on compound toxicity. RNA 
samples were verified to be free from DNA contamination by running controls in which 
MMLV reverse transcriptase enzyme was omitted from the multiplex assay protocol. 
Purified RNA controls were included on each plate for process quality control and 
tracking. 

Primer Design 

Assay specificity was determined by utilizing unique primers for each 
gene. Target-specific primers w«e designed to six target sequences and two reference 
sequences CTable 1). Both forward-TSPs and reverse TSPs were synthesized, having 
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sequences as delineated in Table 2. The 5' region of the target-specific sequences 
includes sequences complementary to one of two universal sequences 

Tflhie 1: Tarpet-Specific primers for M ultiplexed Analysis of Gene 
F.Yp^ip^«f ^ion in PC-3 cells 



PCT/US01/027S7 



Target Sequence 


F- primer 


R-primer 


Size (bp) 


beta-actin 


Sp61F 


T7(P7)R3/R3pi (1:39) 


117 


cloning vector lambda EMBL3 








SP6/T7 fragment in 




T7(P7)R2 


127 


GibcoBRL 7.5 kp mRNA 


Sp6(P2)F2 


INAD 


Sp6Fl (P2) 


T7R1(P7) 


147 


hSPE 


Sp6F2 (P2) 


T7R2(P7) 


157 




Sp6Fl(&F2) 




200 


survivin 


(P2) 


T7R2(P7) 


HNFSalpha 


Sp6F3(P2) 


T7R3(P7) 


215 


GAPDH 


Sp6Fl (P2) 


T7R1 (P7) 


237 


EST 


Sp6(P2)F4 


T7(P7)R4 


266 


Hoxb 13 


Sp6Fl (P2) 


T7R1(&R2)(P7) 


283 


(KanR) aminoglycoside 3'- 


Sp6(P2)(LP70)F 


T7(P7)R2 


322 


phosDhotransf erase 


2 



Table 2: Tafgp t-Snecific Primer Sequences 



Accession # 



Primer Primer Name Primer Sequence 

AGGTGACACTATAGAATAACCGAT 
AAGGCCAACCGCGAGAAGATGA 
OTACXiACTCACTATAGGGATGGAT 
(\GCAACGTACATGGCTG 
GTA€GAerCACrATAGGGATGGAr 
AGCAACOTACATGGCrGPi 
AQGTGACACTATAGAATAACTATG 
CCGGTATCAGCACX: 
GTACGACTCACTATAGGGAGATGG 
CAGCX5TGATTTCAC 
AGGTOACACTATAGAATAGTGACA 
CGTCGCAGAATOAG 
OTACGACTCACTATAGGGATTGAC 

DcrrcAGTracTTGA 

AGGTQACACTATAQAATAGCTTCA 
ITAGGTGGCTCAACA 
GTACGACrCACTATAOGGAGGCrC 

aocitgtcotagttc 
aogtgacacrataoaatagtcaoc 
cxiaaccttcacatc 
qtaooactcactataggoacx:acc 

CTGCAGCTCTATGAC 

agqtgacactatagaataacttca 
aogcatacgaacag 



K00351 
X003S1 

X00351 
U02426 
fragment 
U02426 
fragment 

INAD 

[NAD 

hSPE 

hSFE 

Survivin 

Survivin 

HNF3aipha 



6-actin forward 

B-actin reverse 
B-actin reverse 
Phosphorylated 

7,5 kb forward 
7.5 kb reverse 
INA D forward 
INAD reverse 
hSI^ forward 

hSPE TtvcrsB 
Survivin 
forward 



Survivin reverse 
HNF3 alpha 
forward 



Sp6.1Fl 
T77R3 

T77R3Pi 

Sp6CP2)F2 

T7(P7)R2 

Sp6Fl(P2) 

T7R1 (P7) . 

Sp6F2(P2) 

T7R2(P7) 
Sp6Fl(&F2) 
(P2) 

T7R2(P7) 



Sp6F3(P2) 
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m^r J aipna 




HNF 3 alpha 


reverse 


T7R3 (P7) 




GAPDH 


Vf33197 


forward 


Sp6Fl (P2) 


HvfJO 1 Q"7 


tiAFDH reverse 


T7R1 (P7) 




GAPDH reverse 




pnospnoryiatea 


TVRlPi {P7) 


EST 






EST 


EST reverse 


T7R4 rP7) 




Hoxb 13 


Hoxb 13 


forward 


Sp6Fl (P2) 






T7R1(&R2) 


Hoxb 13 


Hoxb 13 reverse 


(P7) 


701839 




Sp6(P2)(LP70) 


(V00359) 


KanR forward 


F2 


J01839 






(V00359) 


KanR reverse 


T7(P7)R2 



GTACGACTCACTATAGGGAGGGAG 
CTAGGAAGTGTTTAG 

AGGTGACACTATAGAATAAAGGTG 
AAGGTCGGAGTCAA 

GTACGACrCACTATAGGGAATOAC 
AAGCTTCCCGTTCTC 

GTACGACrCACTATAGGGAATGAC 
AAGCrrcCCGTTCTCPi 
AGGTGACACTATAGAATAOCTCAT 
CTGCCAACAATC 

GTACGACTCACTATAGGGACTAGC 
GGAAGCAAATTACAC 

AGGTGACACTATAGAATAGCGACA 
rGACTCCCTGIT 

GTACGACTCACTATAGGGAAACTT 
GTTAGCCGCATACTC 

AGGTGACACTATAOAATAATCATC 
AGCATTGCATTCGATTCCTGnTG 

rACGACTCACTATAGGGAATTCCG 
ACTCGTCCAACATC 



Preparation of Primer Sequences 

Oligonucleotides were prepared using phosphoramidite methodology on 
an ABI 394 DNA synthesizer using standard procedures and reagents, including dG'^ 
FastPhosphoramidite (PE Biosystems 401183), 0.02M Iodine (PE Biosystems 401732) as 
oxidant, and 0.25M 5-ethyl-lH-tetrazole (Glen Research 30-3140-52) as activator. 5'- 
biotinylated nucleotides were incorporated using commercially available amidite reagents 
as described in the procedure below. Preparation of the cleavable primer sequences 
involved the synthesis of a protected 3' thiothymidine reagent (5'-0-Dimethoxytrityl-3'- 
thiothymidine-3'-5-(2-cyanoethyl)-A^^-diisopropyl phosphorothioamidite). The 3 - 
thiothymidine nucleotide was incorporated in an automated fashion using the protected 
phosphoramidite reagent described above. Column chromatography was carried out 
under a positive pressure of argon gas. HPLC data were collected on an Hewlett-Packard 
1 100 series instrument at 260 nm. 

In cases where mass spectrometric analysis was performed, one universal 
primer of each target-specific primer pair was prepared having a biotin moiety 
incorporated at the 5'-end, and a chemically-cleavablc base, 3'-ihiothymidinc at an 
appropriate position. Cleavage of the amplified PGR product at the position of the 3'- 
thiothymidine reduces the measured DNA size, thus providing fragments suitable for 
optimal mass spectral resolution and sensitivity. Furthermore, the cleavable bases could 
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be introduced in various positions within different univwsal primers used in different 
multiplex reactions. The various cleaved positions yield a series of non-overlapping mass 
spectral peaks suitable for multiplexed readout. 

5'-biotin phosphoramidite (Glen Research 10-5950-90, 0.IM in anhydrous 

5 acetonitrile) and Thio-T amidite (0. IM in anhydrous acetonitrile) were employed in the 
synthesis of the universal primers. The synthesis was carried out using a 10-minute 
coupling time for Biotin and a two 5-minute couphngs for Thio-T. The crude 
oligonucleotide was deprotected in 28% aqueous NH3 at 55 °C for two hours. Removal 
of the solvent gave a white residue that was desalted on a NAP- 10 column (Phaimacia 

10 17-0854-01) with ddH20. The product was analyzed by HPLC using a Supelcosil LC- 
18-T column (Supelco 58971) and a gradient of 10 to 20% acetonitrile from 5 to 25 min. 
at 1 mL per min. in O.IM TEAA. Typical retention times were about 10 to 15 min., and 
the purity of the product should exceed 80%. 

For cases where the samples were analyzed on a fluorescence 

15 electrophoretic device, a universal primer was synthesized that included a dye at the 5' 
end. Fluorescent dye labeling of primers with 6-FAM was carried out on an automated 
DNA synthesis device using 5'-fluorescein phosphoramidite (Glen Research, Sterling, 
VA). 

"Shifted" Universal Primers 

20 Greater assay throughput is achieved by mixing PGR products of the 

original gene set (i.e. target sequences) with a "shifted" gene set so that signals from the 
products of the two gene sets are interieaved. The "shifted" genes are separated from the 
original genes by the same number of bases for each product in the multiplexed gene set. 
The "shifted" genesets are generated by the addition of nucleotides to the labeled strand 

25 of the universal primer to increase the length of the PGR products. Spacers are used to 
separate the label from the specific portion of the universal primer sequence. 

Shifted target universal primers were synthesized that ccmtained a 
nonnucleotide linker. The nonnucleotide linker used was an abasic nucleotide, dSpaoer 
phosphoramidite, 5;-dimethoxytrityl-l,2Kiideoxyribose-3*-cyanoetfayl phosphoramidite 

30 (Glen Research, Sterling. VA) The dSpacer was incorporated during automated DNA 
synthesis on a DNA synthesis device using standard methods. After incorporation of the 
dSpaccr between 1 to 10 thymidine bases weie incorporated and optionally a dye label 
was also added. 
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For example, the universal primers used in a first series of multiplex 
amplifications to generate an original geneset comprises a FAM-labeled Sp6 universal 
sequence (forward direction) and an unlabeled T7 universal sequence (reverse direction) 
Labeled Sp6: 5HFAM)-AGG TGA CAC TAT AGA ATA-3' (SEQ ID No. 23) 
Non-labeled T7: 5*-GTA CGA CTC ACT ATA GGG A-3'(SEQ ID No. 24) 
Altematively, the T7 sequence can carry the fluorescent label while the 
Sp6 sequence is unlabelled: 

Non-labeled Sp6: 5'-AGG TGA CAC TAT AGA ATA-3' 
Labeled T7: 5'-(FAM)-GTA CGA CTC ACT ATA GGG A-3' 

In a second set of multiplex amplifications, universal primers containing 
additional nucleotides are employed (dS = dSpacer phosphoramidite, available from Glen 
Research, Steriing VA), such that the molecular weight or mass of the resulting amplified 
sequences is altered as compared to the first series of amplification reactions. Exemplary 
universal primers for generation of the shifted geneset are: 

Labeled Sp6: 5'-(FAM)-TITnTr-dS*'AGG TGA CAC TAT AGA ATA-3' 
Non-labeled T7: 5'-GTA CXjA CTC ACT ATA GGG A-3' 

As with the primers used in the previously-described amplification 
reaction, the label can be carried on either of the universal sequences employed: 
Non-labeled Sp6: 5"-AGG TGA CAC TAT AGA ATA-3' 
Labeled T7: 5'-(FAM)-TnTnT-dS*-GTA CGA CTC ACT ATA GGG A-3' 
Reactions may also be performed separately for the same set of target 
sequences using multiple dyes, which are then mixed to increase throughput Labeled 
universal primers are also "shifted" in size to avoid overlapping peaks and for improved 
reproducibility. All reactions using multiple dyes were performed with the same non- 
labeled 17 universal primers. Exemplary labeled Sp6 universal primers include: 
FAM-labeled Sp6: 5'-(FAM)-AGG TGA CAC TAT AGA ATA-B' 
HEX4abeled Sp6: 5'-(HEX)-TAG AGG TGA CAC TAT AGA ATA-3* 

or 5 '-(HEX)-TrT-(dS)-AGG TGA CAC TAT AGA ATA^3' 
NED-labeled Sp6: 5'-(NED)-GAT TAG AGG TGA CAC TAT AGA ATA-3' 

Additional primers can be designed by one of skill in the art. For example, 
reactions may also be perfonned where one of the universal primers contains a cleavable 
site and optionally a biotin, for specific solid-phase capture. Cleavable universal primers 
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are "shifted" in size once they are cleaved. As an example, all reactions using cleavable 
Sp6 primers were performed with a non-labeled T7 universal primer. Exemplary labeled 
Sp6 universal primers include: 

Cleavable Sp6: 5'-(Biotin)-AC5G TGA CAC TAthioT AGA ATA-3' 

5 

Anq>lification 

The multiplex amplification step utilized solution-phase quantitative 
multiplex RT-PCR amplification, and was coupled with multiplexed fluorescence or mass 
spectrometric detecUon. Primer pairs (SEQ JD Nos. 1-22) for specific genes and controls 
10 were designed using Primer-3 software (Whitehead Institute for Biomedical Research, 
Cambridge, MA). 

Reverse transcription to generate first sti^and cDNA was carried out using 
30ng of total RNA, 0.02 jiM primers, ImM dNTPs, RNasin ribonuclease inhibitor (2.5 
units. Promega, Madison, WI), and MMLV reverse transcriptase (10 units, Promega, 

15 Madison, WI) at 42''C for 30 minutes. PGR amplifications were performed using COlpM 
gene-specific primers, litU universal primers, 0.375mM dNTPs (Piomega, Madison, 
WI), and AmpliTaq Gold polymerase (I unit, Perkin Elmer, Foster Qty, CA) in the buffer 
supplied widi the enzyme. Thermal cycling was performed on a Parkin-Elmer GeneAmp 
9700 between 30 to 45 cycles using the following conditions: 94X for 30s. 55*C for 30s, 

20 and 68°C for 1 minute. Multiplex PGR products were resolved using either the 

electrophoresis or capillary systems for fluorescent readout when they were all in the 
linear range of amplification, and were quantified by fluorescence intensity. For 
fluorescent readout, one of the universal primer pairs used for PCR amplification was 
labeled with the fluorescent dye 6-FAM utilizing 5'-fluoiescein phosphoramidite (GImi 

25 Research, Sterling, VA). 

Gel electrophoresis 

The samples were prepared for multiplex fluwescent readout4ising a gel 
electioph(»esis system fwrai The Gel Company (San Francisco, CA) by diluting the RT- 
PCR products 1 :4 in GE sample dilution buffer (a 1:3.3 dilution of fluorescently labeled 
30 ladder (CXR Huorescent Ladder. Promega, Madison. WI), 1:16 dilution of blue dextran, 
and 1: 1.6 dilution of deionized fotmamide). Hie fluorescent ladder was used as a gel 
standard wi tit every sample for normalization of die target PGR inoduct sizes. After 
denaturing the samples at 95''C for 5 minutes and cooling in ice-watw bath for 5 minutes, 
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O.Sfil of the diluted RT-PCR samples were loaded onto a 96-well linear loading tray and 
transferred via absorption onto a 96-lane paper comb. The comb was then inserted onto 
the gel and samples were allowed to run into the gel for approximately 35 seconds, after 
which the comb is removed and discarded. 

5 Capillary electrophoresis 

RT-PCR products for multiplex fluorescent readout using the capillary 

electrophoresis system were diluted 1:10 in CE sample dilution buffer (1:5 dilution of 

fluorescently labeled ladder in deionized formamide). Approximately 10/jl1 of the diluted 

RT-PCR samples were placed in receptacles specific for the capillary electrophoresis 

10 instrument and denatured at 9SX for 5 minutes. The samples were then cooled for 5 

minutes in ice-water bath prior to performing die capillary electrophoresis. 

Mass spectroscopic analysis 

Subsequent to PGR amplification, samples were processed to prepare them 
for mass spectrometric analysis. The processing steps were conducted in 384-welI plates 

15 on a robotic workdeck containing a magnetic platform to facilitate manipulation and 
washing of magnetic beads. 

Streptavidin-coated magnetic beads were added to each sample in binding 
solution, 10 mM Tris, 20 mM ammonium acetate, ImM EDTA buffer, pH 7.2, and 
incubated at room temperature for 20 minutes to allow binding of the biotinylated primer. 

20 The sample tray was placed on a magnet platform of a robotic workstation to precipitate 
the DNA bound to the beads. After the beads were pelleted, the supernatant was 
removed, and the pellet was rinsed once with binding solution. 

A denaturing solution of O.IN NaOH was used to rinse the pelleted beads 
and to remove the non-biotinylated complementary strand, A second aliquot of the 

25 denaturing added, mixed above the pelleted beads, then incubated. The mixing process 
was repeated four times, then the final supernatant was removed. The beads were washed 
five times with a 20 mM ammonium acetate solution, then twice with deionized water to 
lemove residual salts. The beads were then resuspended in a cleavage solution (0.1 mM 
silver nitrate) and the samples were incubated at 48° C. for IS minutes. The tray was 

30 returned to the wotkstation to precipitate the beads, and the supernatant was transferred to 
a ftesh 384-well tray. A solution of 70 mM DTT solution was added to samples in the 
new tray to quench the reaction, and samples were dried in a vacuum centrifuge. 
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Approximately 0.5 mL of a matrix solution consisting of a 5: 1 molar ratio 
of 3-hydroxypicolinic acid (3-HPA) to picoUnic acid (PA ) was added to each well 
containing dried sample. The matrix solution was prepared by mixing 18 jiL of a freshly 
prepared saturated 3-HPA solution (about.0.5 M) with 2 of 1 M PA. The redissolved 
5 samples were then spotted (either manually or robotically) onto a mass spectrometer 
sample plate, 0.5 fil and aUowed to crystallize for subsequent analysis. 

For mass spectrometry readout, a linear time-of-flight (TOF) mass 
spectrometer was employed, using an acceleration voltage of +20 kV; delay of +3.6 kV at 
1.12 lisec; laser setting of 179 on the polarizer, mass gate of 5.84 jisec; and 400 shots. 
10 Furthermore, a 2-point mass calibration with a 15-mer (4507.0 Da) and a 36-mer 
(10998.2 Da) was utilized. 

Quantitative levels of all genes in each sample, including target and 
external spike control genes, were normalized to the internal controls, and are expressed 
as ratios to the control ("housekeeping") genes GAPDH and P-actin. 

15 Validation of Primer Design 

Multiplexed amplifications were validated to ensure that each primer pair 
was specific for a particular target sequence and that there were no interactions among the 
target sequences in the multiplex. This was accomplished by conducting drop-out 
experiments, in which the multiplex amplification was run in the absence of a particular 

20 primer pair. Additionally, tiie amplification reaction was validated by comparing the 
results of primers in different multiplex environments, ensuring identical PGR product 
sizes in each case. Furtiiermore, primers were also tested for efficiency by running the 
multiplex assay on RNA samples known to express all of die targeted sequences. 

pxam ple 14: M ulti plex Strategies 
25 Table 3 depicts exemplary strategies for multiplexing samples in tiie 

methods of the present invention. Multiplex reactions A and B illusti^te fundamental 

multiplexing strategies for use in the methods of the present invention. .In these assays, 

all of tfie forward univeraal primers (UPfs) include the same universal sequence: in 

addition, a single type of dye label is incorporated into the primers. In multiplex reaction 

30 A, the reverse universal prinaets (UPrs) all have the same sequence with each other, but a 

different sequence from the forward universal primers. The reverse universal primers do 

not have an incorporated dye. In multiplex reaction B, aU of the forward universal 

primers and icvei«e univeiaal primers contain tiie same sequence, and therefore both 
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Strands of the products will have an incorporated dye. In the given example, at the end of 
each type of reaction, the multiplexed samples contain 12 strands of amplified products 
(two complementary strands from each of six templates), with dye incorporated in either 
half {for example A) or all (multiplex reaction B) of the strands. Because the dye is the 
5 same for all targets, detection of individual products depends on their separation (in this 
case, based on size). 

Multiplex reaction C depicts an embodiment in which semi-univeisal 
primers are used to shift the mobility of a subset of the amplification products during size 
separation of otherwise overlapping peaks. Two forward universal primers are used for 
10 designated subsets of targets. Both primers are labeled with the same dye, but one of 
them additionally contains a friction group (i.e., an attached moiety that generates drag on 
molecules as they migrate through a non-matrixed, liquid solution). See, for example, 
Hubert and Slater (1995) Electrophoresis 16:2137-2142. In this example, the sizes of 
products 1 and 4, 2 and 5, and 3 and 6 are the same or overlapping, but corresponding 
15 peaks 1, 2, and 3, as well as 4, 5, and 6 arc different sizes. The fiiction group will be 
incorporated into products 4, 5, and 6, while leaving products 1, 2, and 3 unmodified. As 
a result, the moWlitics of products 4-6 will be retarded relative to 1-3, resolving these 
otherwise overlapping sets into six separate peaks. The illustration represents the reverse 
universal primers as all being the same sequence, but these primers may also comprise a 
set of semi-universal primers. 

Multiplex reaction D illustrates another embodiment of the components of 
the multiplex reaction which can be employed in order to resolve overlappmg signals. In 
this reaction profile, two amplification products of the multiplex arc the same size. The 
mobility of one of the two overlapping signals can be shifted by adding a nucleic acid 
sequence to one or both of the TSPs for one of the target sequences, lengthening its 
amplification product A similar effect is obtained by designing semi-univeraal primers 
of different sizes. 

Multiplex reaction E illustrates an important embodiment of the methods 
of the present invention, which provides a mechanism by which the signals for multiple 
species arc resolved by separating other than by size. A set of semi-universal primers is 
employed in the multiplex reaction; each UPf is labeled with one of a set of indepcndCTt 
labels, each of which can be detected uniquely. As witfi multiplex reaction C, the sizes of 
products 1 and 4, 2 and 5, and 3 and 6 arc taken as the same or overiapping, but peaks 1, 
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2. and 3. as weU as 4, 5, and 6 are different sizes. Products 1-3 will be labeled with dye 
number I, and products 4-6 with dye number 2. The two sets of three products wUl still 
have overlapping mobiUties, but the fluorescent signals given by each of the two dyes can 
now be separated by deconvolution of the emission spectral data. As in the previous 
example, the UPrs can also be designed as semi-universal primers. 

Multiplex reaction F illustrates a method for obtaining signals from a 
greater number of unresolved species than the number of available dyes. Two dyes were 
used in the multiplex illustration of multiplex reaction E, enabUng resolution of two 
overlapping signals. In the embodiment described in multiplex reaction E, the signal 
from three unresolved products are obtained using only two dyes with three different UPf 
primers. In this embodimem. the third signal is obtained by double-labeling the 
amplification products of that target. Because the signal from this product is known to 
contain an equivalent fluorescent signal from each of the two dyes, its signal can be 
separated ftom the signals of the two singly-labeled products. This application requires 
that the three types of products are not completely overlapping, which would make 
deconvolution of thdr signals very difficult Ideally, die signals from tite two singly- 
labeled species should not overlap, but some overiap can be resolved by signal processing 
of the data. More complex combinations are obviously possible when more than two 
dyes are used. 

These six cases are provided for illustration of the more important 
embodiments of multiplexing reactions described in tiiis invention. To one skiUed in the 
art, many variations in multiplexing strategies are possible by combining separate 
elements of these examples. In particular, combination strategies can be employed 
making use of the separate fonvaid and reverse universal primers, or die combinations of 
target-specific and universal primers, or semi-universal primers. In all cases, die 
selection of die particular TSP sequences for each target witiiin a multiplex can be 
performed carefully to select die size of each PCR product and ensure that each product 

can be detected uniquely. 

Optionally, die metiiods of the presem invention include methods to 
increase tiie number of samples simultaneously analyzed by pooling the products of 
separate reactions. This strategy increases the throughput and reduces die cost of die 
assay for situations in which die pooled ^oducxs camiot be generated in die same reaction 
(for example, when each separate reaction is already maximized in multiplexing 
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potential). For example, samples are pooled after the RT-PCR reaction is complete, and 
prior to analysis and quantitation. 

Table 3: Multiplexing Strategies for the RT-PCR 



Example UPf UPf UPr UPr Application 
label label 


A 


UPf 1-6 = 
sequence "A" 


dye#l 


UPr 1-6 = 
sequence "B" 


none 


Resolution of a simple multiplex 
by size (two universal primers) 


B 


UPf 1-6 = 
sequence "A'* 


dye#l 


UPr 1-6 = 
sequence "A" 


dye#l 


Resolution of a simple multiplex 
by size (one universal primer) 


C 


UPf 1-3 set, 
sequence "A" 
+ 

UPf 4-6 set, 
sequence "A'* 


dye#l 

dye#l + 

friction 

group 


UPr 1-6 = 
sequence "B" 


none 


Use semi-universal primers to 
create resolution by affecting 
mobility 

Create resolution by size shifting 
(where ampliiication products 1-3 
have overlapping masses with 
products 4-6) 


D 


UKl-6 = 
sequence **A" 


dye#l 


UPr 1-6 = 
sequence •'B" 


none 


Create resolution by shifting size 
(TSP length was changed to shift 
the mass of it's amplicon) 


E 


UPf 1-3 = 
sequence "A" 
UPf 4-6 set, 
sequence "B" 


dye#l 
dye #2 


UPr 1-6 = 
sequence '*C' 


none 


Use semi-universal primers to 
resolve by size & fluorescence 
(multiplexing with dyes) 


F 


UPf 1 = 
sequence "A" 
+ 

UPf2 = 
sequence "B" 
+ 

UPf3 = 
sequence "C 


dye#l 

dyes#l 
and 2 
(50:50) 
dye #2 


UPr 1-6 = 
sequence "D" 


none 


Increase dye multiplexing capacity 
possible with a fixed number of 
dyes 



5 



Example IS: Pooling of samples using interieaving penesets or multiple 
dvcs 

RT-PCR samples for the same multiplexed reaction may be mixed at 
appropriate ratios by combining either the original set of target sequences with the 

10 "shifted" target sequence set, and/or by combining reactions with multiple dyes. These 
mixed samples axe then diluted in the appropriate sample dilution buffer and loaded onto 
the gel or capillary electrophoresis system. Exemplaiy profiles of original and "shifted" 
multiplex genesets aie shown in Figure 4. Examples of profiles generated by multiplexed 
amplification with diffmnt dyes using multiplex genesets are shown in Figure S. Several 

15 illustrations of pooling strategies are listed in Table 4, and described below. 
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Multiplex reaction G illustrates an embodiment of a fundamental pooling 
strategy for use in the methods of the present invention. In this example, two separate 
reactions (Gl and G2) comprise different multiplexes. The combined products of the two 
separate reactions are resolvable by size. (For examples G through M, it is assumed for 

5 illustration that all of the products of each separate multiplex are resolvable by size.) As 
an example, each separate reaction may be perfomied with the same UPf primer, labeled 
with the same chromaphore. After the reaction, the samples are combined for analysis. 
All of the individual signals from the two reactions are then resolved by size. 

The embodiments provided in Cases H-L illustrate various ways of 

10 resolving the same set of ampUfied sequences generated in separate reactions. Multiplex 
reaction H illustrates the use of isotopic or chemical modification to generate shifts in the 
masses of otherwise equivalent amplification products. For example, deuterated dNTPs 
may be used to generate "heavy" amplification products (designated as sequence A in 
reaction H2) in one reaction, while unmodified dNTPs are used in another (reaction HI). 

15 Hie heavier deuterium isotopes of hydrogen that are incorporated in one set of reaction 
products will generate a shift in the mass of each product relative to the equivalent 
amplicon of the other reaction. 

The embodiment illustrated with multiplex reaction I makes use of the 
friction molecules described previously in multiplex reaction C. Li multiplex reaction I, 

20 two reactions (II and 12) of tfie same multiplex set are performed, die first with 

unmodified UPf primers and the second with UPf primers containing a friction group. 
Both primers are labeled with the same dye. After die reaction, samples are combined for 
analysis. The friction group wiU be incorporated into all of die products of reaction 12. 
As a result, the otherwise overlapping signals wiU be separated by the frictional drag of 

25 one species relative to the other. 

Multiplex reaction J provides a way for detecting duplicate multiplex sets 
by a mass shift. In this embodiment, two UEf primers are used, one of which is shorter 
(in reaction Jl) than die otiier (reaction J2). Two separate reactions are conducted, each 
using different universal primers. This will result in a duplicate signal pattern in which 

30 one group is offset from tiie otiier by a fixed size. This size offset can also be 

accomplished by using two UPf primas coupled witfi two UPr prima?, and changing the 
lengths of one pair of UPf and UPr primers by a lesser amount. 
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Figure 4 depicts exemplary detection profiles of original and "shifted" 
multiplex genesets, as prepared by methods of the i»^sent invention. The position of the 
signal along the X-axis generally correlates with number of nucleotides in the amplified 
product, while the Y axis indicates intensity of fluorescent signal. Panel A represents 
data as collected for an "original" geneset, while panel B depicts data for a "shifted" 
geneset (for which, in this example, the amplified products appear to have a greater mass 
or friction coefficient as compared to the unmodified amplification sequences). Panel C 
presents the original and shifted genesets together, demonstrating the resolution 
introduced into the products of the "shifted" amplification reaction. 

Multiplex reaction K illustrates a pooling strategy based on a mass shift 
between duplicate multiplex sets, just as with multiplex reaction J. In this illustration 
primers of the same sequence and length are used for both multiplexes. However, for one 
of the reactions {K2), the UPf incorporates a site of cleavage between two nucleotides in 
the extension product. (Thus, the label must be incorporated 3* to the cleavage site in 
order for it to remain with the extension product). After amphfication is complete, the 
products made with the modified primer are cleaved, removing a fixed number of 
nucleotides from the 5* end of the labeled strand. Cleavage may be perfonned after 
pooling of separate reactions. Cleavage sites can be situated in one of several positions in 
a primer sequence, facilitating pooling of multiple reactions. 

In the embodiment illustrated in multiplex reaction L, identical 
multiplexed reactions are generated (reactions LI, L2 and L3). Rather than mixing the 
reactions prior to loading on the separation platform, they are simply loaded individually, 
but with time delays, in order to generate an offset in their relative positions in the 
separation medium. 

Multiplex reaction M illustrates the use of multiple labels, e.g. fluorescent 
dyes, each of which can be uniquely detected. In this embodiment, three separate 
reactions (Ml, M2 and M3) are performed with a single UPf primer sequence, but that 
contains one of three different labels. After the reaction, the three samples are combined 
for analysis. Each particular target from each reaction will have the same size as those 
from each of the other reactions. The triplicate sets of signals from the three reactions 
will be resolved by deconvolution of the fluorescence data. Examples of profiles 
generated by multiplexed ampUfication with different dyes using multiplex genesets are 
shown in Figure S. The position of the signal along the X-axis correlates with number of 
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nucleotides in the amplified product, while the Y axis indicates intensity of fluorescent 
signal. Panel A = FAM-labeled products; panel B = HEX-Iabeled products; panel C = 
NED-labeled products; and panel D = FAM, HEX. & NED-labeled products combined. 
As with all othw case illustrations, the UPr primers can be utilized in conjunction with the 
UPf primers to design more complex strategies. 

Table 4: Pooling Stratepies for Analysis 



Reaction 


UPf 


Label 


UPr 


L«aDei 


/\ppiicauun 




(product) 




(product) 






Gl 


UrI l-O 


aye tfi 




untie 


Resolution bv size. 




(seq A; 










G2 


UPf 7-12 


dye#l 


UPr 7-12 


none 






(seq B) 




(seqB) 






HI 


UPf 1-6 


aye #1 


Urr l-O 


none 


^PTuirate reactions have relative 




(seq A; 




vseq o) 




mobility shifts ftom use of diffo'ent 


n2 




aye tr i 


TTPr 1-6 




isotooes 




(seq a") 




(seqB) 






11 


UPf 1-6 


dye #1 


Urr 1-0 


none 


deparaic leaciiuiia luiv^ idaurc 




(seq A) 




(seqB) 




moDlliQ^ snius rBSULung iroin uib 


12 


UPf 1-6 


dye #1 + 


UiT l-O 


none 


^^r!Hnn'^ omun ^ote: oroduct masses 




/-^«. A\ 

(seq A) 


friction 






nf the two reactions overlat)) 






groi?) 








n 


TSPtset 


dye#l 


UPr 1-6 


none 


Separate reactiras have relative mass 




#1 (sea A) 




(seqB) 




offsets resulting from primer 


J2 


TSPf,set 


dye#l 


UPt7-12 


none 


length differences. 




#2 (seqA 




(seqB) 








+ S bases) 










Kl 


UPf 1-6 


dye#l 


UPrl-6 


none 


Separate reacdons have reladve 




(seq A) 




(scqB) 




mobility 


K2 


UPf 1-6 


dye#l 


URrl-6 


none 


shifts resulting fit»n removal of 




(seqA + 




(seqB) 




nucleotides by cleavage within the 




cleavage 








primer. 




site) 










LI 


UPf 1-6 


dye#I 


UPr 1-6 


none 


Sq)arate reactions have relative 




(seq A) 




(seqB) 




mobility shifts resulting from 
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L2 



L3 



UPf 1-6 
(seqA) 
UPf 1-6 
(seqA) 



dye#l 
dye#l 



UPrl-6 
(seqB) 
UPrl-6 
(seqB) 



none 



none 



staggered sample loading on the 
separation platfonn 



Ml 



M2 



M3 



UPf 1-6 
(seqA) 
UPf 1-6 
(seqA) 
UPf 1-6 
(seqA) 



dye#l 
dye #2 
dye #3 



UPrl-6 

(seqBi 

UPrl-6 

(seqB) 

UPrl-6 

(seqB) 



none 



none 



none 



Tfaiee separate reactions are pooled for 
analysis. Resolution by size & 
fluorescence (multiplexing with dyes). 
(Note: products masses of the three 
reactions overlap.) 



Note: "Product" refers to the amp, 
"heavy" version of seq A 



ification product; product seq A" rqnesents a 



TABLE 5: PRIMER SEQUENCES 



SEQ ID 
No. 


Accession # 


Primer 


Primer Name 


Primer Sequence 


SEQ ID 

Nol 


X003S1 


beta-actm forward 


Sp6.1Fl 


AGGTOACACTATAGAATAACCGA 
TAAGQCCAACCGCGAGAAGATGA 


SEQID 
No. 2 


X00351 


beta-actin reverse 


T77R3 


GTACGACTCACTATAGGGATGGA 
TAGCAACGTACATGGCTG 


SEQID 
No. 3 


X0O351 


beta-actin reverse 
Phosphorylated 


T77R3Pi 


GTACGACTCACTATAGGGATGGA 
TAGCAACGTACATGGCTGPi 


SEQID 
No. 4 


U02426 
fragment 


7.S kb forward 


Sp6(P2)F2 


AGGTGACACTATAGAATAACTAT 
GCCGGTATCAGCACC 


SEQID 
No. 5 


U02426 
fragment 


7.5 kb reverse 


T7(P7)R2 


GTACGACTCACTATAGGGAGATG 
GCAGCGTGATTTCAC 


SEQID 
No. 6 


n/a 


INAD forward 


Sp6Fl (P2) 


AGGTGACACTATAGAATAGTGAC 
ACGTCGCAGAATGAG 


SEQID 
No. 7 


n/a 


INAD reverse 


T7R1 (P7) 


GTACGACTCACTATAGGGATTGA 
CCCTTCAGTTGCTTGA 


SEQE) 
No. 8 


n/a 


hSPE forward 


Sp6F2(P2) 


AGGTGACACTATAGAATAGCTTC 
ATTAGGTGGCTCAACA 


SEQID 
No. 9 


n/a 


hSPE reverse 


T7R2(P7) 


GTACGACTCACTATAGGGAGGCT 
CAGCTTGTCGTAGTTC 


SEQID 
No. 10 


n/a 


Survivin fwward 


Sp6Fl(&F2) 
(P2) 


AGGTGACACTATAQAATAGTCAG 
CCCAACCTTCACATC 


SEQID 
No. 11 


n/a 


Survivin reverse 


T7R2(P7) 


OTACGACTCACTATAGGGACCAC 
CCTGCAGCTCTATGAC 


SEQID 
No. 12 


n/a 


HNF3alpha 
forward 


Sp6F3 (P2) 


AGGTGACACTATAGAATAACTTC 
AAGGCATACGAACAG 


SEQID 

No. 13 


n/a 


HNF3alpha 
reverse 


T7R3(P7) 


GTACGACTCACTATAGGGAGQGA 
GCTAGGAAGTGTTTAG 


SEQID 
No. 14 


M33197 


OAPDH forward 


Sp6Fl(P2) 


AGGTGACACTATAGAATAAAGGT 
GAAGGTCGGAGTCAA 


SEQID 
No. 15 


ND3197 


GAPDH reverse 


T7R1(P7) 


GTACGACTCACTATAGGGAATGA 
CAAGCTTCCCGTTCTC 
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SEQID 
No. 16 


M33197 


GAPDH reverse 
phosphorylated 


lilt « n* A¥^^ V 

T7R1P1 (P7) 


riT A on Ar^TT* APT AT A nrui A ATGA 
CAAGCTTCCCGTTCTCPi 


SEQID 
No. 17 


n/a 


EST forward 


Sp6F4(P2) 


ACjO X (j Aw A\-. 1 A i i 1 

TCTGCCAACAATC 


SEQID 
No. 18 


n/a 


EST reverse 


T7R4(P7) 


GT ACaj At- 1 v.^ AC 1 A I /\ovj\j/*v.- 1 rtvj 
CGGAAGCAAATTACAC 


SEQID 
No. 19 


n/a 


Hoxb 13 forward 


Sp6FI(P2) 


AGGTGACACTATAGAATAGCGAC 
ATGACTCCCTGTT 


SEQID 
No. 20 


n/a 


Hoxb 13 reverse 


T7R1(&R2) 
(F7) 


GTACGACTCACTATAGGGAAACT 
TGTTAGCCGCATACTC 


SEQID 
No. 21 


J01839 
(V00359) 


KanR forward 


Sp6(P2)(LP70 
JF2 


AGGTGACACTATAGAATAATCAT 
CAGCATTGCATTCGATTCCTGTTT 
G 


SEQ ID 
No. 22 


Julo39 
(V00359) 


jvaXiJi reverse 




TACGACTCACTATAGGGAATTCC 
GACTCGTCCAACATC 


SEQID 
No. 23 


n/a 


Sp6 


universal 
primer 


AGGTGACACTATAGAATA 


SEQID 
No. 24 


n/a 


T7 


universal 
primer 


GTACGACTCACTATAGGGA 



The cases described above are provided for illustrative purposes. One 
skilled in the art can envision other embodiments that would achieve the general purpose 
of increasing sample throughput during separation and data collection. 

While the foregoing invention has been described in some detail for 
purposes of clarity and understanding, it wiU be clear to one skilled in the art from a 
reading of this disclosure that various changes in form and detail can be made without 
departing fiom the true scope of the present invention. For example, all the techniques 
and compositions described above may be used in various combinations. All of the 
compositions and/or methods disclosed and claimed herein can be made and executed 
without undue experimentation in Ught of the present disclosure. While the compositions 
and methods of this invention have been described in terms of preferred embodiments, it 
will be apparent to those of skill in the art that variations may be applied tolhe 
compositions and/or metiiods, and in die steps or in tiie sequence of steps of the method 
described herein witiiout departing fiom the concept, spirit and scope of tiie invention. 
More specifically, it will be apparent that certain agents which arc both chemicaUy and 
physiologically related may be substituted for the agents described herein while tiie same 
or similar results would be achieved All such similar substitotes and modifications 
apparent to tiiose skilled in die art are deemed to be witfiin tiie spirit, scope and concept of 
the invention as defined by Uie appended claims. All publications, patents, patent 
appKcations, and/or otiier documents cited in this application arc incorporated by 
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reference in their entirety for all purposes to the same extent as if each individual 
publication, patent, patent application, and/or other document were individually indicated 
to be incorporated by reference for all purposes. 
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WH/^TrsnT^ATMEDIS: 

1. A method for analyzing gene repression comprising: 

a) obtaining a pluraUty of target sequences, wherein the pluiaUty of target 

sequences comprises cDNA; 
5 b) multiplex amplifying said plurality of target sequences, wherein multiplex 

amplifying comprises combining the plurality of target sequences, a plurality of target- 
specific primers, and one or more universal primers, thereby producing a plurality of 

an^)lification products; 

c) separating one or more members of the plurality of amplification products; 
10 d) detecting one or more members of the plurality of amplification products, 

thereby generating a set of gene expression data; 

e) storing the set of gene expression data in a database; and 

f) paforming a comparative analysis on the set of gene expression data, 
thereby analyzing the gene expression. 

15 2. The method of claim 1 , wherein obtaining die target sequences comprises 

performing reverse transcription of mRNA. 

3. The method of claim 2, wherein the mRNA comprises mRNA derived from 
culmred cells. 

4. The method of claim 2, wherein said mRNA comprises mRNA derived from 
20 cultured cells subjected to a specific treatinent, 

5. The method of claim 4, wherein said specific treatment comprises a chemical 
exposure, an environmental stress, or an exposure to one or more viable organisms or 
cells. 

6. The metiiod of claim 1, wherein multiplex amplifying comprises 

25 simultimeously amplifying a pluiabty of cDNA in die same reaction mixture; wherein 
said pluraUty of target-specific primers comprises one or more target-specific primer 
pairs, each pair comprising a forward target-specific primer and a reverse target-specific 
primer, and wherein the one or more universal primers comprises one or more universal 
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5 primer pairs, each pair comprising a forward universal primer and a reverse universal 
primer. 

7. The method of claim 1, wherein said plurality of target sequences fiirther 
comprises one or more reference sequences, wherein a portion of the one or more 
reference sequences is homologous to at least one member of the plurality of target- 

10 specific primers. 

8. The method of claim 7, wherein one or more of the reference sequences 
comprises sequences endogenously present in the cDNA. 

9. The method of claim 7, wherein one or more of the reference sequences 
comprises sequences exogenously added to the cDNA. 

1 5 10. The method of claim 1 , wherein at least one member of the plurality of target- 

specific primers or imiversal primers further comprises a modified nucleotide, 

11. The method of claim 10, wherein the modified nucleotide prevents 
amplification of one or more portions of the at least one member of the plurality of target- 
specific primers or universal primers 

20 12. The method of claim 10, wherein the modified nucleotide comprises one or 

more non-nucleotide linkers, alkyl chains, or abasic nucleotides. 

13. The method of claim 1, wherein at least one member of the plurality of target- 
specific primers or universal primers further comprises a cleavable linker. 

14. The method of claim 1, wherein at least one universal primer further 
25 comprises a label. 

15. The method of claim 14, wherein the label comprises one or more of a 
chromaphoxe, a fiuorophore, a dye, a releasable label, a mass label, an affinity label, a 
friction moiety, a hydiophobic group, or an isotopic label 

16. The method of claim 1, wherein each member of the plurality of target- 

30 specific primers comprises a first sequence that is derived fi'om a target gene of interest 
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5 and positioned within a 3* region of the member, and a second sequence that is 

complementary to the universal primer and positioned within a 5* region of the member. 

17. The method of claim 1. wherein the one or more universal primers comprise 
one or more serai-universal primers. 

18. The method of claim 17, wherein the one or more semi-universal primers 
10 comprise primers which are complementary to one or more forward target-specific 

primers, one or more reverse target-specific primers, or a combination thereof. 

19. The method of claim 18, wherein the one or more semi-universal primers 
comprise a firat semi-universal primer that is complementary to all of the one or more 
forward target-specific primers, and a second semi-universal primra: that is 

15 complementary to all of the one or more reverse target-specific primers. 

20. The method of claim 17, wherein each of the one or more semi-uni veisal 
primers comprises a luique label. 

21. The method of claim 1, wherein multiplex amplifying comprises providing the 
universal primer in an excess concentration relative to the target-specific primer. 

20 22. The method of claim 2 1 , wherein a universal primer: target-specific primer 

concentration ratio ranges from about 5:1 to about 100:1. 

23. The method of claim 21, wherein a universal primer target-spedfic primer 
concentration ratio is about 10:1. 

24. The method of claim 1 , wherein an annealing temperature of the universal 
25 primer is higher than an anneaUng temperature of the target-specific primer. 

25. The method of claim 1, wherein obtaining a plurality of target sequences 
comprises providing two or more target sequences having two or more target-specific 
primer annealing temperatures. 

26. The method of claim 1, wherein multiplex amplifying the cDNA comprises 
30 amplifying target genes that have comparable expression levels. 
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5 27, The method of claim 1, wherein multiplex amplifying the cDNA comprises 

attenuating an amplification of abundant target genes. 

28. The method of claim 27, wherein attenuating the amplification of abundant 
target genes comprises using one or more modified target-specific primers. 

29. The method of claim 28, wherein the one or more modified target-specific 
10 primer comprises a blocking group attached at a 3' end of the modified target-specific 

primer. 

30. The method of claim 28, wherein the one or more modified target-specific 
primer comprises one or more abasic nucleotides or mismatch nucleotides, 

31. The method of claim 28, wherein using one or more modified target-specific 
15 primers comprises providing a mixture of the one or more modified target-specific 

primers with one or more unmodified target-specific primers, at a ratio optimized for a 
desired amount of attenuation. 

32. The method of claim 28, wherein the one or more modified target-specific 
primer comprises a blocking group attached at a 3' end of a reverse target-specific primer. 

20 33. The method of claim 28, wherein the one or more modified target-specific 

primers comprise primers having a phosphate group on the terminal 3 -hydioxyl of the 
target-specific primer, 

34. The method of claim 28, wherein the one or more modified target-specific 
primers comprise primers having a nucleotide penultimate to the terminal 3 -nucleotide 

25 and attached via a 3-3' phosphodiester linkage. 

35. The method of claim 1, wherein multiplex amplifying further comprises 
altering the length of one or more of the universal primers or Oile or more of the plurality 
of target-specific primers prior to combining. 

36. The method of claim 35, wherein altering the length comprises adding 
30 nucleotides to an end of a universal primer or a target-specific primer. 
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37. The method of claim 35, wherein altering the length comprises inserting 
nucleotides within a universal primer or a target-specific primer. 

38. The method of claim 37, wherein altering the length of a target-specific primer 
comprises inserting nucleotides between a universal sequence and a target-specific 
sequence of the target-specific primer. 

39. The method of claim 35, wherein altering the length comprises incorporating a 
non-nucleotide linker into a universal primer or a target-specific primer. 

40. The method of claim 35, wherein altering the length comprises cleaving the 
one or more universal primers or the one or more target-specific primras, 

41. Tlie method of claim 35, wherein one or more of the universal primers or one 
or more of the plurality of target-specific primers comprise semi-universal primers. 

42. The method of claim 1, wherein the plurality of amplification products 
comprises a plurality of labels at predetermined molar ratios. 

43. The method of claim 42, wherein the plurality of labels is incorporated on a 
single oligonucleotide primer. 

44. The method of claim 42, wherein the plurality of labels is incorporated on a 
plurality of oligonucleotide primers. 

45. Tlie method of claim 1, wherein separating the one or more members of the 
plurality of amplification products comprises performing one or more size separation 
techniques. 

46. The method of claim 45, wherein separating the one or more members of the 
plurality of amplification products comprises performing mass spectrometry. 

47. The method of claim 45, wherein separating the one or more m«nbers of the 
plurality of aii^)lification products comprises employing an electrophoresis platform. 
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5 48. The method of claim 47, wherein the electrophoresis platform comprises one 

or more of a capillary platform, a microcapillary platfonn, a microfluidics platform, an 
agarose gel, an acrylamide gel, an agarose/acrylamide gel or a chromatographic platform. 

49. The method of claim 45, wherein separating the one or more members of the 
plurality of amplification products comprises performing HPIX or FPLC. 

10 50. The method of claim 1, wherein separating the one or more members of the 

plurality of amplification products comprises performing HPLC followed by mass 
spectroscopy. 

51. The method of claim 1, wherein detecting the one or more members of the 
plurality of amplification products comprises measuring one or more inherent properties 

IS of the amplification products. 

52. The method of claim 51, wherein the one or more inherent properties comprise 
mass, light absorption, or an electrochemical property. 

53. The method of claim 1, wherein detecting the one or more members of the 
plurality of amplification products comprises measuring the presence, absence, or 

20 quantity of a labeled amplification product. 

54. The method of claim 53, wherein the labeled amplification product comprises 
a singly labeled amplification product, a multiply-labeled amplification product, or a 
combination thereof. 

55. The method of claim 53, wherein detecting comprises resolving a first signal 
25 from a singly labeled amplification product and a second signal from a multiply labeled 

amplification product by deconvolution of the data. 

56. The method of claim 53, wherein detecting comprises resolving a first signal 
from a singly labeled amplification product and a second signal from a multiply labeled 
amplification product by reciprocal subtraction of the first or second signal from an 

30 overlapping signal. 

57. The method of claim 1, wherein petfomiing the comparative analysis 

comprises measuring a ratio of each target gene to each reference gene. 
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5 58. The method of claim 1, wherein one or more of the multiplex ampli^ng, 

separating and detecting is pwformed in a high throughput format. 

59. A method for analyzing gene expression comprising: 

a) obtaining cDNA from a plurality of samples for a plurality of target 

sequences; 

10 b) performing a plurality of multiplexed amplifications of the target 

sequences, thereby producing a plurality of multiplexed amplification products; 

c) pooling the plurality of multiplexed amplification products; 

d) separating the plurality of multiplexed ampUfication products; 

e) detecting the plurality of multiplexed ampUfication products, thereby 
15 generating a set of gene expression data; 

f) storing the set of gene expression datain adatabase; and 

g) performing a comparative analysis of the set of gene expression data. 

60. The method of claim 59. wherein performing the plurality of multiplexed 
amplifications comprises combining the plurality of target sequences, one or more target- 

20 specific primers, and one or more univCTsal primers 

61. The method of claim 60. wherein at least one of the one or more universal 
IHimers or one or more target-specific primes comprises a label. 

62. The method of claim 61, wherein a first multiplexed amplification is 
performed witii a primer comprising a first label tfiat produces a first signal, and a second 

25 multiplexed ampUfication is performed with a primer comprising a second label that 
produces a second signal, wherein tfie first and second signals arc distinguishable from 
one another. 

63. The metiiod of claim 62, whaan tiie first and second signals are 
distinguishable by deconvolution of signals obtained fixmi ttie pluiaUty of multiplexed 

30 ampUfication products. 

64. The method of claim 61. wherein the first or second label comprises a hi^- 
affinity intercalating dye. 
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5 65. The method of claim 60, wherein performing the plurality of amplifications of 

the target sequences comprises using universal primers having two or more lengths, and 
wherein detecting the plurality of multiplexed amplification products comprises 
measuring one or more size shifts among the plurality of multiplexed amplification 
products. 

10 66. The method of claim 60, wherein performing the plurality of amplifications of 

the target sequences comprises using target-specific primers having two or more lengths, 
and wherein detecting the plurality of multiplexed amplification products comprises 
measuring one or more size shifts among the plurality of multiplexed amplification 
products. 

IS 67. The method of claim 66, performing the plurality of amplifications of the 

target sequences comprises using universal primers comprising one or more cleavage 
sites, and wherein detecting the plurality of multiplexed amplification products comprises 
measuring one or more size shifts among the plurality of multiplexed amplification 
products* 

20 68. The method of claim 60, wherein separating the plurality of multiplexed 

amplification products comprises shifting the mobility of member amplification products 
relative to one another 

69. The method of claim 68, wherein shifting the mobility comprises 
incorporating a friction moiety into one or more of the universal primers, thereby creating 

25 a reduction in mobility of the amplification products. 

70. The method of claim 59, wherein separating comprises applying each set of 
multiplex amplification products to a separation platform at differmt times. 

71. The method of claim 59, wherein peiforming the plurality of amplifications 
comprises performing a polymerase chain reaction, a transcription-based amplification, a 

30 self-sustained sequence replication, a nucleic acid sequence based amplification, a ligase 
chain reaction, a ligase detection reaction, a strand displacement amplification, a repair 
chain reaction, a cyclic probe reaction, a rapid amplification of cDNA ends, an invader 
assay, a solid phase assay, a solution phase assay, or a combination thereof. 
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5 72. The method of claim 71, wherein the solid phase assay comprises a bridge 

amplification or rolling circle amplification. 

73. The method of claim 59, wherein one or more of the performing, separating 
and detecting is performed in a high throughput f onnat 

74. The method of claim 73, wherein one or more of the performing, separating 
10 and detecting steps is performed at a rate of about 1000 samples per hour. 

75. A metfiod for analyzing gene expression comprising: 

a) obtaining cDNA firom multiple samples; 

b) amplifying a plurality of target sequences fit)m the cDNA, thereby 
producing a multiplex set of amplification products; 

15 c) separating and detecting the amplification products using a high 

throughput platfoim, wherein detecting generates a set of gene expression data; and 

d) storing the set of gene expression data in a database; and 

e) performing a comparative analysis of the set of gene expression data. 

76. The method of claim 75, wherein amplifying the plurality of target sequences 
20 comprises using one or more universal primers. 

77. The method of claim 75, wherein amplifying the plurality of target sequences 
comprises using one or more target-specific primers. 

78. The method of claim 77, wherein the one or more universal primers or the one 
or more target-specific primers comprise one or more non-nucleotide linkers. 

25 79. The method of claim 75, wherein separating and detecting the amplification 

products comprises performing mass spectrometry, polyacrylamide gel electrophoresis, 
HPLC, capillary electrophoresis, microcapillary electrophoresis, or a combination thereof, 

80. The method of claim 75, wherein separating and detecting the amplification 
products is performed using microfluidic devices. 

30 81. The method of claim 75, wherein the high throughput platform comprises an 

HPLC for separating the amplification products and a mass spectrometer for detecting the 
amplification products. 
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82. The method of claim 75, wherein the high throughput platform comprises one 
or more miniaturized scale platforms. 



83. The method of claim 75, wherein one or more of the amplifying, separating 
and detecting steps is performed at a rate of about 100 samples per hour to about 5,000 
samples per hour. 

84. The method of claim 75, wherein one or more of the amplifying, separating 
and detecting steps is performed at a rate of about 1000 samples per hour. 

85. The method of claim 75, wherein amplifying the plurality of target sequences 
comprises performing on or more of a polymerase chain reaction, a transcription-based 
amplification, a self-sustained sequence replication, a nucleic acid sequence based 
amplification, a ligase chain reaction, a ligasc detection reaction, a strand displacement 
amplification, a repair chain reaction, a cyclic probe reaction, a rapid amplification of 
cDNA ends, an invader assay, a solution phase amplification assay, or a solid phase 
amplification assay. 

86. The method of claim 85, wherein the solid phase amplification assay 
20 comprises a bridge amplification or rolling circle amplification. 

87. A pool of amplification products prepared by the method of claim 1. 

88. A pool of amplification products prepared by the method of claim 59. 

89. A pool of amplification products prepared by the method of claim 75. 

90. A system for analyzing gene expression, the system comprising: 

25 a) an amplification module for producing a plurality of amplification 

products fi-om a pool of target sequences, the amplification module comprising at least 
one pair of universal primers and at least one pair of target-specific primers; 

b) a detection module for detecting one or more members of the plurality of 
amplification products, wherein the detection module detects a presence, absence, or 

30 quantity of the one or more members, and generates a set of gene expression data 
comprising a plurality of data points; and 
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5 c) an analyzing module in operational communication with tlie detection 

module, the analyzing module comprising a computer or computer-readable medium 
comprising one or more logical instructions which organize the plurality of data points 
into a database and one or more logical instructions which analyze the plurality of data 
points. 

10 91. The system of claim 90, wherein one or more of the amplification module, the 

detection module, and the analyzing module a comprise high throughput system. 

92. The system of claim 90, wherein the at least one pair of universal primers or 
the at least one pair of target-specific primers comprise one or more abasic nucleotides. 

93. The system of claim 90. wherein the amplification module comprises a unique 
15 pair of universal primers for each target sequence. 

94. The system of claim 90, wherein the amplification module comprises 
components to perform a polymerase chain reaction,.a transcription-based amplification, 
a self-sustained sequence replication, a nucleic add sequence based amplification, a 
ligasc chain reaction, a ligase detection reaction, a stiwid displacement amplification, a 

20 repair chain reaction, a cyclic probe reaction, a rapid amplification of cDNA ends, an 
invader assay, a solid phase amplification reaction, a solution phase amplification 
reaction, or a combination thereof. 

95. The system of claim 90, wherein the detection module comprises a mass 
spectrometer. 

25 96. The system of claim 90, wherein tiie detection module comprises an 

electroi^oretic device. 

97. The system of claim 90, wherein the one or more logical instructions for 
analyzing the plurality of data points comprises software for generating a graphical 
representation of ttie plurality of data points. 

30 98. The system of claim 90, wherein the one or more logical instructions which 

analyze the plurality of data points are embodied in system software which poforms 
combinatorial analysis on the plurality of data points. 
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5 99. The system of claim 90, wherein the one or more logical instructions for 

analyzing the plurality of data points comprises software for performing difference 
analysis upon the plurality of data points. 

160. The system of claim 90, the analyzing module further comprising an 

output file. 

10 101- A composition for preparing a plurality of amplification products 

from a plurality of mRNA target sequences, the composition comprising: 
one or more pairs of universal primers; and 

one or more pairs of target-specific primers, wherein the target-specific primers 
comprise one or more regions complementary to the one or more pairs of universal 
primers and one or more regions complementary to one or more target mRNA sequences. 

102, The composition of claim 101, wherein one or more members of the 
one or more pairs of universal primers or one or more pairs of target-specific primers 
comprises a non-nucleotide linkage. 

103, The composition of claim 101, wherein one or more members of the 
one or more pairs of universal primers or one or more pairs of target-specific primers 
comprise one or more cleavable nucleotides. 

104, A kit for obtaining a multiplex set of amplification products of target 
genes and references-genes, the kit comprising: 

a) at least one pair of universal primers; 

b) at least one pair of target-specific primers; 

c) at least one pair of reference gene-specific primers; and 

d) one or more amplification reaction enzymes, reagents, or buffers. 

105, The kit of claim 83, further comprising: 

e) software for storing and analyzing data obtained from the amplification 
reactions. 

106, The kit of claim 83, wherein the universal-primers comprises labeled 
primers. 
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5 107. The kit of claim 83. wherein the simultaneous amplification of the 

lefeience-genes allows quantitation of the amplification products. 

108. Use of the kit of claim 104 for practicing a method selected from any 
one of claims 1 through 86. 

109. Use of a composition or kit selected from any one of the 

10 compositions or kits of claims lOlto 107 for practicing a method selected from any one of 
claims 1 to 86. 

110. An assay utilizing a use set fourth in claim 109. 
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39. 45-46, 48. 50-53, 58- 
60, 65-68, 71. 73-75. 78- 
79, 81-85. 87-91, 95, 102- 
103 

1-104, 107-110 



Form PCr/ISA/210 (continuation of second shea) (July 1998) 



imenu ja.1 application No. 

TNTEBNATIONAL SEARCH REPORT 

PCT/US01/027S7 




T <— -h.«, crista Claims were foond ua«e««bable (Contiauatioa of Kem 1 of first sheet) 


Ttomtem.»to«a«partto«xbccnestabU,l«dp..«pettofc^ I7(2)(a) roru« rouowu« 
^ «Ute 10 subject oaucr not required to be searched this AuU»^^^ 

^ S^L^'^i^ewpan.of.heimenuaioBalapplicad^^ 

such ao exMiii thai no meaningftH international leaich can be carried out. ipeoficaUy: 
nease See Continuation Sbeet 

^ 2^''^a,«<tependcntcIala.««l««notdralledla«^ 

6.4(a). ' 


Boxn Obseryatlooawh«w«iItyofta«iitlonfai«kUi|(^^ 


TO. Intemarional Searching Authority found muhiple inventioos in this international application, as follows: 

1. □ As all required additional search fees were timely paid by the appUcam. this intern^ 

searchable claims. , . ^.^ • •. 

2. □ AsaUsearchableclainacoaWbeseuchedwithontefTonjustiiy^ 

payment of any additional fee. 

3 n A,onlysomeof,he,equiredadditioi«d.eafl:hfee.we«timelypaidbyihe.pplicam,tt^^ 
■ report covers only those claims for which fees were paid, spedHcally claims No..: 

4 n Noiequiredadditional«^feesweretimelypaidby.heappUcam. Consequemly^ 

i, restricted to the invemion first mentioned in the claitas; it is covered by clM^ 

Remark on Protest □ He addiUooal waich fees were accompanied by the appUcant'sproteu. 

n No protest accompaiued the payment of additional search fees. 


Form PCT/ISA/210 (coniinuaiion of first sheetd)) (My 1998) 



INTEBNATIONAL SEARCH REPORT 



Imeinauanal application No. 
PCT/USOl/02757 



Continuation of Box I Reason 2: Claims 105-107 recite a kit but are dependent on ciaim 83. which is drawn to a method for 
analyzing gene expression. As (he timitationa of clainu 105>107 do not make tense when applied to claim 83, no meaningful 
search could be canied out for claims 105-107. 



Continuation of B. FIELDS SEARCHED Item 3: EAST, STN: Biotis, HCAPlus; terms: mulcipiex, ampUfS8« primer, 
taregt, universalt linker, abasic, electrophoresis, spectromctiy, gene iiear2 expzessioa 



Form PCT/ISA/210 (extra sheet) (July L998) 



